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Preface 



THIS doctoral dissertation is the fruit of five years of full-time research under- 
taken at the Department of Astrophysics and Geophysics of the University 
of Liege on January 1, 1994. Basically, it can be thought of as an extension of 
my MSc thesis, the scope of which was to present in a self-contained fashion the 
Hamiltonian formulation of a specific class of alternative, higher-order theories of 
gravity, namely those relativistic, metric theories based on quadratic curvature La- 
grangians. In 1996 I had the opportunity to stay at the Department of Mathematics 
of the University of the Aegean for two months; the fruitful interplay with my Greek 
collaborators has undoubtedly broadened my research concerns in general relativity 
and cosmology and permeates a significant part of this doctoral dissertation. 

When I started to work on alternative gravity theories, I planned to provide 
the reader of this yet unwritten dissertation with an exhaustive review of higher- 
order theories of gravity. However, I soon realised that the history of these theories 
is fairly intricate, hence quite difficult to summarise; an exhaustive account with 
a wide historical perspective would thereby be — very — lengthy and not suited to 
the present work. Still, I am deeply convinced that understanding the essential 
motivations of our predecessors greatly helps in finding one's path in scientific 
research; but this takes much time, thereby implying less published articles: This 
is not particularly welcomed according to modern standards. Fortunately, during 
the last five years I have never been put under pressure to submit a paper every 
three months or so. 

I wish that the interested reader will find this dissertation helpful and pleasant 
to read . . . 



Cointe, December 1998 L. QuERELLA 
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Abstract 

This dissertation investigates three main topics, all of which dealing with alternative, higher-order gravity 
theories in four dimensions. Firstly, we study the variational and conformal structure of those theories. 
Next, we analyse their Hamiltonian formulation and in particular its relationship with the famous adm 
canonical version of general relativity. Finally, we study higher-order spatially homogeneous cosmologies 
and exemplify how Hamiltonian methods can be utilised to simplify the analysis of the associated field 
equations. 

As regards the first topic, we begin by critically reviewing the variational principle in gravitational 
theory: We argue that the 'Einstein-Palatini', metric-aifine method of variation, aside from being in- 
herently nonequivalent to the Hilbert, purely metric variational principle, leads to inconsistencies when 
applied to generalised gravitational actions including higher-order curvature terms. This conveys us to put 
forth that one possible scheme that does not exhibit the cumbersome features of the 'Einstein-Palatini 
device' is the Lagrange-multiplier version of the latter, namely the constrained first-order formalism. Ap- 
plying this constrained method of variation to a general class of nonlinear Lagrangians we prove that the 
conformal equivalence theorem of these nonlinear theories with general relativity and an additional scalar 
field holds in the extended framework of Weyl geometry. As a direct consequence, we demonstrate that 
the Einstein-Palatini method is a degenerate case of the constrained first-order formalism and that it is 
unable to deal with Weyl spaces. This investigation sheds another light on what is sometimes referred to 
as the universality of Einstein's equations. 

Next, we give a detailed account of the Hamiltonian formulation of higher-order gravity theories. 
After a short summary of Dirac's formalism for constrained systems, we thoroughly analyse the procedure 
that enables one to develop a consistent canonical formulation of any field theory involving higher deriva- 
tives: the generalised Ostrogradsky method. We demonstrate the effectiveness of this modus operandi by 
expressing nonlinear gravitational Lagrangians in canonical form. This conveys us to the main result in 
this part, that is, the equivalence of the Ostrogradsky and adm canonical versions of general relativity. 
We then discuss the issue of boundary terms in the light of the Ostrogradsky formalism. We finally obtain 
the explicit forms of the Hamiltonian constraints and canonical equations derived from generic quadratic 
Lagrangians. 

The last topic is devoted to the study of spatially homogeneous Bianchi-type cosmologies in higher- 
order gravity. We firstly analyse the empty Bianchi-type IX or mixmaster model in the full fourth-order 
gravity theory — without resorting to the Ostrogradsky scheme — on approach to the initial singularity; 
we prove that the mixmaster chaotic behaviour based on the bkl piecewise approximation method is 
structurally unstable and that there exists an isotropic power-law solution reaching the initial singularity 
in a stable, monotonic way. Next, we particularise the aforementioned Ostrogradsky canonical formalism 
to class A Bianchi types in the two distinct variants of the generic quadratic theory, namely the pure 
'-R-squared' case and the conformally invariant, 'Weyl-squared' case respectively. In the former we reduce 
the system of canonical equations to a system of autonomous second-order coupled differential equations 
that we solve analytically for Bianchi type I. In the latter we prove that the Bianchi-type I system is not 
integrable — in the sense of Painleve — ; we determine all particular closed-form solutions and discuss their 
conformal relationship with Einstein spaces. 
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aommaire 

Dans cette these, nous etudions divers aspects lies aux theories alternatives de la gravitation contenant 
des termes d'ordre superieur en la courbure, dans des espaces-temps a quatre dimensions. Nous analysons 
successivement la structure variationnelle et conforme de ces theories ainsi que leur formulation hamil- 
tonienne. Ensuite, nous procedons a I'etude de modeles spatialement homogtoes et, en particulier, nous 
montrons comment le formalisme canonique pent conduire a une simplification dans la recherche de solu- 
tions exactes. 

Dans la premiere partie, apres avoir insiste sur le fait que le principe variationel d'Einstein-Palatini 
n'est, en general, pas equivalent au principe de Hilbert et qu'il presente des incoherences manifestes des 
qu'il est applique a des theories autres que la relativite generale dans le vide, nous avangons qu'un schema 
consistant ne souffrant pas de ces difBcultes est fourni par la methode de variation metrique-afBne avec 
multiplicateurs de Lagrange. Nous demontrons que I'equivalence conforme des theories non lineaires de la 
gravitation avec la relativite generale et un champ scalaire additionnel est egalement verifiee dans le cadre 
geometrique etendu des espaces de Weyl. Comme coroUaire, nous prouvons que la methode d'Einstein- 
Palatini est un cas degenere du formalisme avec contraintes et qu'elle ne permet pas de travailler en 
geometric de Weyl. Sous cet angle, nous donnons une interpretation differente de ce qui est appele, dans 
la littferature rfecente, universalite des equations d'Einstein. 

Nous analysons ensuite de fagon detaillee la formulation hamiltonienne des theories de la gravitation 
d'ordre superieur. Apres avoir rappele les ingredients necessaires a une telle construction, c'est-a-dire 
le formalisme de Dirac des systemes contraints et la methode d'Ostrogradsky generalisee, nous illus- 
trons I'efEcacite de cette derniere en construisant explicitement une formulation canonique des theories 
k lagrangiens non lineaires en la courbure. Considerant le lagrangien d'Einstein-Hilbert comme cas par- 
ticulier, nous demontrons le resultat majeur de cette partie, a savoir I'equivalence entre la formulation 
d'Ostrogradsky de la relativite generale et le celebre formalisme adm. Sous ce nouvel eclairage, nous dis- 
cutons ensuite le probleme des termes de surface pour les theories d'ordre superieur. Enfin, nous obtenons 
la forme explicite des contraintes et des equations canoniques provenant du lagrangien quadratique le plus 
general. 

La derniere partie du travail est consacree a I'etude de modeles spatialement homogenes dans le 
cadre des theories quadratiques de la gravitation. Tout d'abord, nous montrons que, au voisinage de la 
singularite initiale, le comportement oscillatoire et chaotique du modele anisotrope Bianchi IX base sur 
I'approximation dite de bkl est structurellement instable pour la theorie generale purement quadratique. 
Ensuite, nous particularisons le formalisme canonique d'Ostrogradsky aux modeles de Bianchi de classe 
A, en distinguant les deux variantes significatives de la thfeorie quadratique generale, a savoir, le cas oil 
le lagrangien se reduit au carre de la courbure scalaire et le cas conformement invariant, ou le lagrangien 
est egal au produit quadratique contracte du tenseur de Weyl. Dans la premiere variante, nous reduisons 
le systeme canonique de depart a un systeme d'equations differentielles autonomes du second-ordre que 
nous resolvons analytiquement pour Bianchi L Dans la seconde, pour ce mLme modele, nous demontrons 
que le systeme canonique n'est pas integrable, au sens de Painleve, nous dfeterminons toutes les solutions 
analytiques particulieres et discutons leur relation conforme avec des espaces d'Einstein. 
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Chapter 1 



Introduction 



"By Him who gave to our soul the Tetraktys, which hath 
the fountain and root of ever-springing nature. " 

— Pythagorean's oath. 

NOTWITHSTANDING the fact that Einstein's general relativity is experimen- 
tally tested with an overwhelmingly high degree of accuracy — from so- 
lar system tests to binary pulsars observational data — , it has become a 
peremptory necessity to consider alternative theories of gravity. The reasons can 
be summarised as follows. Firstly, although Einstein's theory is the simplest — and 
the most aesthetic — geometrical theory of gravitation settled on the basic postu- 
lates of the equivalence principle and general covariance of the physical laws, there 
are no a priori reasons whatsoever to restrict the gravitational Lagrangian to be a 
linear function of the scalar curvature nor to discard other more complicated frame- 
works obtained upon suitable generalisations of the Einstein-Hilbert Lagrangian 
of general relativity. Examples of such extensions are: scalar-tensor theories and 
higher-order gravity theories in four dimensions; Kaluza-Klein multidimensional 
theories; gauge theories of gravity, with torsion and 'non-metricity'. Secondly, a 
strong research effort has been produced so far in different directions in order to 
formulate a consistent quantum theory of gravity; although the electroweak and 
strong interactions are described by renormalisable quantum field theories, Ein- 
stein's gravitational theory cannot be quantised according to the standard schemes, 
for the Einstein-Hilbert action (with possibly a cosmological constant A) 

does not define a renormalisable quantum field theory. However, by adding to this 
action the most general, covariant action that contains quadratic curvature terms 



Chapter 1. Introduction 



and dimensionless couplings, namely 

5* = /d^x V^ {aR^ - PC^ + -fL^^ + SOR) , 

where C^ denotes the contracted quadratic product of the W'eyl tensor, Lqb the 
Gauss-Bonnet term (topological invariant), and the 'box' the d'Alembertian dif- 
ferential operator,^ one obtains a power-counting renormalisable theory [UD62],^ 
which is asymptotically free [FT81]. Secondly, one hopes that such alternative the- 
ories might provide one with a better approximation, semi-classical limit of a yet 
unknown quantum theory of gravity. Amongst the various attempts to understand 
what a quantum space-time really is, unifying schemes such as string theory, super- 
gravity, or more generally M-theory play a prominent role (see, e.g., [Rov98, Gib98] 
for very recent reviews); and it turns out that higher-derivative terms appear nat- 
urally in the low-energy effective Lagrangians of some of those theories. Thirdly, 
the standard model of relativistic cosmology suffers from a certain number of diffi- 
culties that could perhaps be more naturally resolved in the context of generalised 
theories. For instance, whereas the singularity theorems in general relativity show 
that the occurrence of space-time singularities is a generic feature of any cosmo- 
logical models (under some reasonable conditions), it might happen that in the 
context of alternative theories those unwanted singularities could be avoided; in 
fact, during the last decade the absence of cosmological singularities when higher- 
order curvature terms are taken into account has been pointed out in the literature 
several times (see, e.g., [KRT98] and references therein). 

As stated in the Preface, we do not intend to give an historical perspective of 
the development of higher-order theories of gravity. We just would like to mention 
that it can be traced back to the early years of general relativity, when great physi- 
cists like Weyl, Einstein, Bach, and others were undertaking the first investigations 
aiming at modifying the Hilbert variational principle so as to unify — on purely ge- 
ometrical grounds — electromagnetic and gravitational phenomena. Although this 
programme proved to be, so to speak, 'chimeric', it gave a renewed insight into 
the powerful use of the variational principle in gravitational theory and led many 
researchers to extend its domain of applicability to for instance other geometries 
or greater dimensions. Nowadays, the geometrical structure of any theory is in- 
timately connected with its formulation in terms of an action, in Lagrangian or 
Hamiltonian form, from which are derived the field equations by means of a spe- 
cific, properly defined variational principle. Because of the proliferation of various 



^For a list of the conventions and notations used throughout this dissertation, cf. page 6. 
^Rigorous renormalisability of the general fourth-order action has been proved by Stelle using 
BRS invariance [Ste77]. 



types of alternative gravity theories, it is of fundamental importance not just to 
confront their theoretical predictions with observational data but also to expressly 
understand their geometrical structure and the possible interconnections between 
their respective solution spaces. For instance, one question worth to be addressed 
in that respect is whether the conformal equivalence between a certain class of al- 
ternative theories and general relativity would hold in the context of metric-afiine 
variations of generalised actions in Weyl geometry; another is to analyse the Hamil- 
tonian formulation of those theories and possibly their quantisation. As regards 
higher-order theories though, it should be borne in mind that the field equations 
are much more intricate than Einstein's — any method of order reduction is thus 
most welcomed. 

Whereas plane wave solutions of linearised Einstein gravity propagate two phys- 
ical degrees of freedom, carrying helicity ±2, the most general quadratic gravita- 
tional Lagrangian has eight degrees of freedom: a massless spin-2 state, a massive 
spin-0 state, and a massive spin-2 ghost [Ste77, Ste78]. Higher-order gravity theo- 
ries have not received general acceptance as viable physical theories because some 
solutions of the classical theory are expected to have no lower energy bound and 
therefore exhibit instabilities, namely 'runaway solutions': the linearised field equa- 
tions propagate ghosts, i.e. negative-energy modes, and possibly tachyons.^ How- 
ever, crucial nonperturbative results have changed the bad reputation associated 
with this nonunitary character. For instance, the zero-energy theorem states that: 
Although the theory admits linearised solutions with negative energy all exact solu- 
tions representing isolated (asymptotically flat) systems have precisely zero energy; 
the solutions to the linearised equations with nonzero energy of either sign do not 
correspond to the limit of a one-parameter family of exact solutions [BHS83]. An 
interesting consequence of this theorem is that one cannot draw conclusions from 
the linearised theory concerning the stability of the full quantum theory. 

One special variant of the generic quadratic theory, which has attracted much 
interest as a promising candidate for quantum gravity, is called conformal grav- 
ity;"^ it is the purely quadratic theory based on an action containing only the 
Weyl-squared, C^ term, which possesses the aesthetically pleasing feature of being 
the local gauge theory of the conformal group and hence locally scale invariant. 
This means that the theory bears no resemblance to Einstein gravity classically; 
it encompasses six degrees of freedom corresponding to massless spin-2 and spin-1 
ordinary states and a massless spin-2 ghost state [LN82, Rie84]. Conformal gravity 
satisfies at least two remarkable properties [Rie86]: Birkhoff's theorem holds — in 



^This drawback of higher-order gravity pertains in fact to any theory with higher derivatives; 
see, e.g., [PU50]. 

■^For a detailed bibliography, the interested reader may consult [Que98]. 
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stark contrast to the generic quadratic case — and nonperturbative effects can con- 
fine the ghosts — analogously to the confinement of colour in quantum chromody- 
namics. The conformal fourth-order field equations, first put forth by Bach [Bac21], 
are found in differential geometry and in mathematical studies of Einstein's field 
equations of general relativity that use conformal techniques; they also constitute 
necessary conditions for a space to be conformal to an Einstein space. The classical 
study of the Bach equations is partly justified by the fact that conformal gravity 
has been viewed, during the last decade, as a possible, physically viable alternative 
theory of gravity that would be able to resolve some of the problems general rela- 
tivity alone is unable to address without ad hoc assumptions such as, for instance, 
the dark matter hypothesis [MK89]. However, a deeper investigation reveals other 
open problems that necessitate further consideration (see, e.g., [Kle98] for the most 
recent contribution). In particular, little work has been done in regard to spatially 
homogeneous cosmologies. 

Aside from conformal gravity, it is of great interest to study other variants 
of higher-order theories in the context of cosmology. The Friedmann-Lemaitre- 
Robertson-Walker (flrw) spaces, which are isotropic and are homogeneous on 
spacelike sections, constitute the basic pillar of the standard model of cosmology, 
which successfully accounts for many of the observed features of our universe. How- 
ever, this scheme is not free from certain riddles: the so-called 'horizon', 'flatness', 
and 'smoothness' problems. These are addressed, more or less adequately, by incor- 
porating the generally accepted inflationary paradigm into the orthodox isotropic 
cosmology with additional ingredients such as hot and cold dark matter blends — Is 
anybody still laughing at Ptolemaic epicycles? It is known that generalised gravity 
theories such as quadratic or scalar-tensor theories do possess solutions exhibit- 
ing an inflationary stage; in addition, they could possibly provide us with more 
satisfactory mechanisms to trigger inflation. On the other hand, mathematical 
cosmology focusses more on spatially homogeneous and anisotropic models since 
they are fairly more general than the FLRW universes and because one aims to in- 
vestigate specific issues, such as for instance the genericness of oscillatory, chaotic 
dynamical regimes on approach towards the initial singularity, unhindered by the 
stringent symmetry requirement that the universe be isotropic ab initio. 

In this dissertation we examine Lagrangian and Hamiltonian variational meth- 
ods for higher-order gravitational actions and in particular for nonlinear and con- 
formally invariant theories; we apply these methods to study the classical solution 
space of spatially homogeneous cosmological models in higher-order gravity. The 
outline is the following:^ 



'A more detailed plan is provided at the beginning of each of those chapters, i.e. on pp. 9, 49, 
and 125 respectively. 



In Chapter Two we begin by critically reviewing the variational principle 
in gravitational theory; we analyse the metric-afhne variational method in 
the context of generalised gravity theories in order to seek in which specific 
circumstances it can be utilised to deal with extended geometrical settings, 
such as Weyl geometry. We show that the Einstein-Palatini method exhibits 
inconsistencies and that it is a degenerate case of the constrained first-order 
formalism, which is the appropriate framework that can incorporate general 
Weyl spaces. We prove the chief result of this chapter, namely that the con- 
formal equivalence theorem of nonlinear theories with general relativity and 
additional scalar fields holds in the extended framework of Weyl geometry. 
As a direct consequence, we give a different interpretation of a universality 
property of Einstein's equations found in the context of the Einstein-Palatini 
method. 

In Chapter Three we focus on the Hamiltonian formulation of theories with 
higher derivatives. After a short summary of Dirac's formalism for con- 
strained systems and a detailed account of the generalised Ostrogradsky 
method, we develop a canonical formulation of nonlinear gravitational La- 
grangians of the type f{R)- Next, we prove the most important result of 
this chapter, namely that the Ostrogradsky Hamiltonian formulation of gen- 
eral relativity is equivalent to its well known ADM canonical version. We 
then discuss the issue of boundary terms in the light of the Ostrogradsky 
formalism and give the explicit forms of the super-Hamiltonian and super- 
momentum constraints, and of the canonical equations that are derived from 
generic quadratic Lagrangians. 

In Chapter Four we study spatially homogeneous Bianchi-type cosmologi- 
cal models in some variants of higher-order gravity. We firstly analyse the 
empty Bianchi-type IX or mixmaster model in the purely quadratic theory — 
without resorting to the Ostrogradsky method — on approach to the initial 
singularity; we prove that the mixmaster chaotic behaviour based on the 
BKL approximation scheme is structurally unstable and that there exists an 
isotropic power-law solution reaching the initial singularity in a stable, mono- 
tonic way. Next, we particularise the results obtained at the end of Chapter 
Three to class A Bianchi types in the two distinct variants of the generic 
quadratic theory, namely the pure 'i?-squared' case and the conformally in- 
variant, 'Weyl-squared' case respectively. In the former we reduce the system 
of canonical equations to a system of autonomous second-order coupled dif- 
ferential equations that we solve analytically for Bianchi type I. In the latter 
we prove that the Bianchi-type I system is not integrable — in the sense of 
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Painleve — ; we determine all particular solutions that may be written in 
closed analytical form and discuss their conformal relationship with Einstein 
spaces. 

Conventions and notations 

In this dissertation we adopt the sign conventions of the gravitation bible [MTW73]; 
in particular we use the metric signature ( — \- ++). We also make use of a 'cus- 
tomised version' of the abstract index notation discussed in [Wal84]: Latin indices 
of a tensor that belong to the beginning of the alphabet a,b,c, . . . denote the type 
of the tensor — not its components — ; Latin indices of a tensor that belong to the 
middle of the alphabet i,j, k, . . . refer to spacelike components in the Cauchy hy- 
persurfaces Sj defined by the slicing of space-time; however, in Chapter 2 we do 
not employ Greek indices a, /3, 7, . . . to refer to the components of a tensor with 
respect to a specific coordinate space-time basis: we keep Latin indices a,b,c, . . . 
instead. 

The symbol Va usually stands for the covariant derivative operator but oc- 
casionally denotes the associated linear afhne connection. In Chapter 2 we also 

o 

employ the symbol Va to refer to the covariant derivative operator associated with 
the Levi-Civita connection, the components of which in a non-coordinate basis are 

where {J«} are the Christoffel symbols, i.e. the components of the Levi-Civita 
connection in a natural basis, and the C's are the structure coefficients of the 
non-coordinate basis. 

The Riemann curvature tensor is defined by 

Rcdb^Va = -R\cdVa = (VeVrf - V d^ c)vb, 

for arbitrary vectors u and one-forms Va, where Va is the covariant derivative 
operator. The Ricci tensor is obtained by contraction on the first and third indices, 
that is 

Rab = R acb- 

Gothic characters denote tensor densities; e.g., 21'*'' := ^/—gA"''^; round and 
square brackets around indices denote respectively symmetrisation and antisym- 
metrisation (including division by the number of permutations of the indices); a 
tilde denotes conformally transformed quantities. 
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Unless otherwise stated we use geometrised units, where the Newton gravi- 
tational constant G^ and the speed of light in vacuum c are set equal to one: 
G^ = l = c. 

We adopt Schouten's nomenclature for the type of spaces considered [Sch54]. 
An Ln is a general n-dimensional manifold endowed with a linear connection; when 
the latter is symmetric the L„ is called an An and is torsion-free. When a metric 
tensor gab is defined, the compatibility condition does not hold in general, i.e. 
"^^cdab = —Qcab 7^ 0. If Qcab = 0, the Connection is called metric with respect to 
gab and the L„ is called a [/„; if in addition the connection is symmetric, one has 
a Vn, i.e. Riemann space. If Qcab = QcSab, the connection is called semi-metric; if 
in addition it is symmetric, the L„ is called a Wn, i.e. Weyl space. 
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Chapter 2 

Variational and conformal 
structure in higher-order gravity 



"Sub fide vel spe geometricantis naturos." 

— Giordano Bruno. 

VARIATIONAL PRINCIPLES play a prominent role in theoretical physics; it 
has become well accepted during this century that any fundamental phys- 
ical theory can be formulated in terms of an action, in Lagrangian or 
Hamiltonian form, from which are derived the equations of motion by means of 
a variational principle. For the last decades this has been raised to a metalaw of 
nature: Nowadays, setting up a (field) theory means that one starts to write an 
action in terms of the fields one considers, even if the knowledge of the actual form 
of their interactions and symmetry properties is fragmentary. Specification of the 
Lagrangian function is determined by mathematical and physical requirements like 
gauge invariance, renormalisability, simplicity, and so forth. Yet, certain peculiar- 
ities of the Lagrangian that arise under symmetry transformations, such as the 
appearance of a total divergence, might indicate "that one is dealing with some 
approximation or a limiting case of a 'better' theory, in which the corresponding, 
possibly modified, symmetries fully preserve the action integral" [Tra96]. 

Even though we do not intend to discuss epistemological or metaphysical issues 
related to the significance of the variational method we would like to emphasise 
that, historically, it has often been the focus of philosophical contentions and mis- 
conceptions for, in contradiction to the usual causal description of phenomena, "the 
idea of enlarging reality by including 'tentative' possibilities and then selecting one 
of these by the condition that it minimizes a certain quantity, seems to bring a 
purpose to the flow of natural events" [Lan49, p. xxiii]. Still, one ought not be 
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disconcerted: For the universal mind of the seventeenth and eighteenth centuries, 
the two ways of thinking did not necessarily appear contradictory. Leibniz who had 
a strong influence on the development of the variational method — for example, the 
present use of the word 'action' in physics probably originates from Leibniz's expres- 
sion actio formalis — had strong teleological propensities, which also characterised 
the ideas of Fermat, Borelli, and Maupertuis. By contrast, the sober, matter-of- 
fact nineteenth century — which still gets hold of numerous present-day scientists — 
looked at the variational principles (of mechanics) merely as convenient alternative 
mathematical formulations of the fundamental laws, without any primary impor- 
tance whatsoever. This pragmatic point of view has however changed with the 
advent of general relativity — in the light of which "the application of the calcu- 
lus of variations to the laws of nature assumes more than accidental significance" 
[Lan49] — and quantum mechanics — especially with regard to Feynman's 'sum-over- 
histories' approach, in relation with Dirac's deep intuition [FH65, MTW73].^ 

As stated in the Introduction, we aim to analyse the metric-affine variational 
method as applied to gravity theories in order to determine in which specific cir- 
cumstances it can be regarded as a possible generalisation of the standard metric 
variational principle of general relativity. First of all, we make a brief historical 
survey to see how the metric-affine variational principle was introduced in general 
relativity.^ 

The purely metric or Hilbert variational principle of general relativity was prop- 
erly defined during the years 1914 to 1916 owing to the works of Einstein, Hilbert, 
and Lorentz for any of whom, at that time, the metric tensor was thought of as the 
only fundamental gravitational field. However, from the early works of Levi-Civita 
and Hessenberg in 1917, Weyl and Cartan developed, until 1923, the new con- 
cept of affine connection on manifolds without a metric structure. For many years 
Weyl's theory of symmetric linear connections [Wey50a] has been a rich source of 
inspiration for himself and physicists like Eddington and Einstein whose aim was 
to unify — on purely geometrical grounds — gravitational and electromagnetic phe- 
nomena by means of an affine variational principle. Unfortunately their attempts 
did not meet their hopes and Einstein abandoned the purely affine theory on be- 
half of a metric-affine variational method in which the metric tensor and the affine 
connection are considered as independent fields through the process of variation; 
Einstein proved exactly, for the first time, what is placed in most modern textbooks 
under the authorship of Palatini — even though Palatini's contribution (cf. equation 



^In particular, we now know that the value of the action integral — and not only its variation — 
is physically relevant. 

^For a more detailed account, we refer the interested reader to [FFR82, Viz89, Que98], and 
references therein. 
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(2.7)) was formulated in a purely metric framework [Pall9, Pau58] — , namely the 
equivalence of Einstein's field equations of vacuum general relativity and the field 
equations that are derived by means of metric-afiine independent variation of the 
Einstein-Hilbert gravitational action [Ein25]. Once again, Einstein's attempt of a 
geometric unification failed and the metric-affine method lost its interest — even if 
Schrodinger revisited the question twenty years later [SchSO] — until the late fifties 
with the works of Stephenson [Ste58, Ste59] and Higgs [Hig59] who analysed the 
field equations obtained from quadratic Lagrangians via a metric-afiine variational 
principle; more specifically, they considered those equations as an alternative set 
to Einstein's field equations: The choice of Lagrangians — quadratic in the various 
curvature tensors — was again motivated by Weyl's unified theory of gravitation 
and electromagnetism.^ However, Buchdahl raised severe conceptual objections in 
regard to the self-consistency of the method and proposed implicitly to abandon 
the use of 'Palatini's device' in gravitational theory [Buc60]. 

At the same time the so-called 'Hilbert-Palatini action principle' together 
with the 'three-plus-one decomposition' of space-time was invoked successfully by 
Arnowitt, Deser, and Misner (adm) in order to develop a Hamiltonian formulation 
of general relativity [ADM62, MTW73, §21.2 and §21.7].'' However, referring to the 
'Palatini variational principle' is misleading. As a matter of fact, there is no need 
whatsoever to resort to a metric-affine variation in order to rewrite the Einstein- 
Hilbert action in canonical form.'^ In that respect, the analogy (cf. [MTW73, 
§21.2]) between the 'Palatini variation' and Hamilton's principle in phase space is 
inaccurate — stricto sensu it is wrong. ^ In fact, the conjugate momenta are defined 
in terms of the extrinsic curvature, not in terms of the connection; the equivalence 
of purely metric and metric-affine variations in vacuum general relativity is a mere 
coincidence (cf. Subsection 2.1.2). 

In the same spirit as in Stephenson's articles, Yang investigated a theory based 
on a Lagrangian that is quadratic in the Riemann tensor, by analogy with the Yang- 



^There were numerous attempts towards that goal, which were chiefly characterised by a 
modification of the variational principle through difi^erent kinds of alteration of its underlined 
geometric structure: semi-Riemannian manifolds in dimensions greater than four (e.g., Kaluza- 
Klein theories); nonmetric connections (e.g., Weyl geometry, theories with torsion); purely affine 
connections (e.g., Einstein-Schrodinger theory); and so forth. For an exhaustive study on all 
these alternative theories, we refer to the remarkable treatise of Tonnelat [Ton65]. 

■^See Subsection 3.1.1 on page 50 and Subsection 3.3.1 on page 89. 

'By contrast, the Ashtekar canonical formalism uses explicitly the Hilbert-Palatini first-order 
variational method, with the tetrad field and spin-connection as independent variables, and where 
only the self-dual part of the curvature is retained in the Lagrangian; see, e.g., [Pel94]. 

^This was clearly emphasised by El-Kholy, Sexl, and Urbantke who distinguished between 
what they called the 'Palatini principle' and the 'formal Palatini method of variation'; the adm 
procedure belonging to the second class [ESU73]. 
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Mills Lagrangian [Yan74]. Unfortunately, Stephenson's and Yang's field equations 
are tainted by a conceptual mistake occurring in the process of variation. Before 
this error was noticed, several authors proved that those equations were leading to 
generic unphysical solutions and consequently they ruled them out 7 

Extending previous results — only valid in vacuum general relativity — of Lanc- 
zos [Lan57] and Ray [Ray 75], Safko and Elston applied the metric-afhne varia- 
tional principle with Lagrange multipliers to quadratic Lagrangians [SE76].^ Un- 
acquainted with the Ostrogradsky method, they also tried to establish a connec- 
tion between the Lagrange-multiplier version of the variational principle and the 
ADM formalism in order to develop a Hamiltonian formulation of quadratic gravity 
theories (cf. Subsection 3.3.4). Independently, Kopczyhski showed that the intro- 
duction of appropriate constraints into the gravitational action may serve to 'unify' 
the variational derivations of distinct theories of gravity such as Einstein's theory 
and the Einstein-Cartan theory [Kop75]. The most recent generalisations of this 
constrained method of variation, for manifolds with torsion and 'non-metricity', 
lead to the metric-affine gauge theory of gravity (mag) [HMMN95]. 

Following Buchdahl [Buc79], we address the questions of the utility and con- 
sistency of this method of variation in the broader context of generalised grav- 
ity theories.^ More specifically, in Section 2.1 we briefly present the well-known 
Hilbert method of variation, mainly to settle our notations, and critically review 
the Einstein-Palatini variational principle in general relativity. In Section 2.2 we 
extend the study of the variational principle to the domain of higher-order La- 
grangians; the ensuing picture reveals that the Einstein-Palatini variational prin- 
ciple is generically unreliable, already at the classical level. This conclusion conveys 
us to carefully formulate a Lagrange-multiplier version of the metric-affine varia- 
tional method that we call constrained first- order formalism. As a first application, 
we consider the variation of several higher-order Lagrangians with a Riemannian 
constraint and correct Safko and Elston's results [SE76]. Furthermore, we show 
that the equivalence of the field equations that are derived from appropriate actions 
via this formalism to those produced by variation of purely metric Lagrangians is 
not merely formal but is implied by the diffeomorphism covariant property of the 
associated Lagrangians. In Section 2.3, after a brief account of the conformal rela- 
tionship between nonlinear and scalar-tensor gravitational Lagrangians and general 
relativity with additional scalar fields, we analyse the conformal structure of non- 



'^Notwithstanding this fact, a detailed study of Yang's equations has been published most 
recently [GN98]. 

*In spite of several misprints in the resulting formulae Safko and Elston's conclusions are right; 
cf. Subsection 2.2.3. 

^We restrict ourselves to geometries without torsion, i.e. symmetric connections. 
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linear gravity theories in the context of the constrained first-order formalism; in 
particular, we prove that the conformal equivalence theorem of those theories with 
general relativity plus a scalar field holds in the extended framework of Weyl geome- 
try. This investigation enables us to give a different interpretation of what has been 
recently called a 'universality property of Einstein's equations' [FFV94, BFFV98] 
and to invalidate a recent claim on a possible explanation of a — controversial — 
observed anisotropy in the universe [TU98, QMC99]. Finally, we point out how 
these results may be further exploited and address a number of new issues that 
arise from this analysis. This work was carried out in collaboration with S. Cotsakis 
and J. Miritzis [CMQ97] (see also [Mir97, Cot97, Que97]). 

2.1 Variational principles in general relativity 

2.1.1 Hilbert variation 

Consider a four- dimensional space-time manifold M endowed with a Lorentzian 
metric gab and assume that the connection Vc be the symmetric Levi-Civita con- 
nection, i.e. VcQab = 0; hence (M, g, V) is a V4, i.e. a Riemannian space. In the 
Lagrangian formulation of the theory the Hilbert metric variational principle pro- 
ceeds with the specification of a Lagrangian density £, which is assumed to be a 
functional of the metric and its first and possibly higher derivatives, that is 

£ = £(g,9g,92g,...). (2.1) 

In addition, one requires that £ be a scalar density of weight -|-1, i.e. £ = ^/^L, 
where g denotes the determinant of the matrix formed with the components of gab 
and L is the Lagrangian; this enables one to form the action integral 

S[g]= [d^n£, (2.2) 

Ju 

where d'^Q, = dx^ A dx^ A dx^ A dx^, which is taken over a compact region It of the 
manifold M. The field equations are obtained by requiring that the action (2.2) be 
stationary under arbitrary variations such that the metric and its first derivatives 
be held fixed on the boundary dU. This variation defines the functional derivative 
2ab of the Lagrangian density £, viz. 

d''n£abSg''\ with £,fe := ^-^, 

also called the Euler-Lagrange derivative of £, and the field equations are 

£a6 = 0. 
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As is well known, the variational principle implies very important differential 
constraints on the field equations, which hold 'off shell', i.e. whether or not the field 
equations are satisfied; these are the generalised Bianchi identities, obtained from 
Noether's second theorem by taking as a specific class of variations of the metric 
that induced by diffeomorphisms / : M ^ M. Since the manifolds (M, g) and 
(M, /*g) are physically equivalent, the action functional does not change under 
the diffeomorphism /; in particular, it remains unaltered under an infinitesimal 
coordinate transformation. For such variations, it is not difficult to see that, at the 
first order of perturbation, 5g is given in terms of the Lie derivative of the metric 
with respect to the vector field v'^ that generates the diffeomorphism /, that is,^° 

Since by definition Hab is a symmetric density of weight +1, the variational principle 
yields 



6S[g] = -2 /d40£a6(vV) =0 
Ju 



for all vector fields v'^ that vanish on the boundary. Integrating by parts the last 
equation and dropping the divergence term one obtains the expected generalised 
Bianchi identities, namely 

V"£a6 = 0. (2.3) 

The simplest Lagrangian density for gravity is the Einstein-Hilbert Lagrangian 
density £eh = yf—gR, where R = QabR"'^ is the scalar curvature, to which one may 
possibly add a cosmological constant term A.yJ—g. The corresponding gravitational 
action is 



c 



2k 



S=— d'^nZ^,, (2.4) 



u 



where k^ = SttG^c'"^ is the Einstein gravitational constant. ^^ The Ricci tensor Rab 
is expressed in terms of the connection coefficients F*^^^ and their first derivatives, 
viz. 

Rab '■= R acb = ^cf ^b ~ ^b^ ac'^'^ cd^ ab ~ ^ bd^ ac- 



i°See [Wal84, Appendix C]. 

^^Hereafter we use 'geometrised units', where the Newton gravitational constant Gn and the 
speed of light in vacuum c are set equal to one, and we rescale the coordinates to absorb the 
constant factor Sn. 
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Since the 'metricity' or compatibility condition holds, i.e. Vg = 0, the F's are the 
Christoffel symbols, namely 

r'ab = Ub} = If" {dbgad + daQdh " ddQah) ■ (2.5) 

As the Einstein-Hilbert Lagrangian density depends linearly on the second-order 
derivatives of the metric, one could discard these higher derivatives through a 
total divergence, the variation of which would not affect the equations of motion. 
Hence, one could start with the so-called 'gamma-gamma' first-order form of the 
Lagrangian density for gravity, which is given explicitly by 

£ = y^5"' {t\,T\^ - T'^.cKb) ■ (2.6) 



This possibility elucidates why Einstein's field equations are second-order instead 
of fourth-order differential equations. However, one should bear in mind that the 
'gamma-gamma' Lagrangian density is no longer a scalar density. 

Nevertheless, one aims at deriving Einstein's equations in vacuum by requiring 
that the action (2.4) be stationary under arbitrary variations of the metric that 
vanish on the boundary. This is achieved with the help of the formula 5y/^ = 
— 2\/~9 9ah^9"' and the Palatini equation 

5R\,d = V,{6T\^) - Vd{6T\,), (2.7) 

which can be easily derived in a locally geodesic coordinate system (see, e.g., [d'I92]) 
and the contraction of which is 

5Rab = V,{5T\,)-V,{5T\,). (2.8) 

The Hilbert metric variation of the action (2.4) is first written as^^ 

6S= j d^O (SQ^^^'Rab + Q'^^^Ral) , 

Making use of equation (2.8) and owing to the compatibility condition (in the form 
VcS"^ = 0) one can transform the second term of the integrand as a pure divergence; 
by Gauss's theorem the corresponding integral becomes a surface integral over the 
boundary dXl and vanishes because the variations are assumed to vanish on the 
boundary. ^^ Hence the variation of the Einstein-Hilbert action reduces to 

5S= /d^fi 0,6(55"' = 0, 
Ju 



^^We recall that 'Gothicised' quantities denote tensor densities; for instance, 21" := ^/—gA"'^, 
for an arbitrary tensor field A" . 

^^In fact, for general variations such that only the metric be held fixed on the boundary, this 
surface integral does not vanish; cf. our subsequent discussion on page 94 on the role of boundary 
terms. 
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where (5ab denotes the Einstein tensor density associated with the Einstein tensor 

Gab '■= Rab — -^Rgab- 

Since the variations Sg""^ and the region of integration XL are arbitrary, one concludes 
that the variational principle for the action (2.4) implies Einstein's vacuum field 
equations. 

In keeping with the above variational principle, when matter comes into play, 
one must add to the gravitational Lagrangian density (2.1) an appropriate La- 
grangian density £„ for the corresponding fields, which assumes a form that is a 
'generalisation' of its special relativistic form — which depends primarily on the field 
variables, collectively called V — , achieved via the strong principle of equivalence 
according to the 'minimal coupling' rule:^^ 

riab > gab, da > Vq. 

Observe that the order of the two steps is irrelevant as long as the connection is 
the Levi-Civita connection: For arbitrary connections the operation of lowering 
and raising indices does no longer commute with the operation of covariant differ- 
entiation. The total action is defined as /(£ + £„) and variation of the second term 
with respect to the metric defines the stress-energy tensor Tab (cf. [LL89, §95]) so 
that the full field equations are Gab = Tab supplemented by the equations of mo- 
tion for the fields jp. Furthermore, the generalised Bianchi identities (2.3) take the 
form of the contracted Bianchi identities, i.e. V^Gab = 0, which in turn entail the 
covariant conservation of the stress-energy tensor, as a direct consequence of the 
invariance of the Einstein-Hilbert action under diffeomorphisms. 

2.1.2 Einstein— Palatini variation 

Consider a four-dimensional space-time manifold M endowed with a Lorentzian 
metric Qab but now assume that the connection Vc be an arbitrary symmetric 
connection, i.e. V cOab / and zero torsion; hence (M, g, V) is an ^4, i.e. an affine 
space provided with a symmetric linear connection (in which here a metric tensor is 
also defined), which is not necessarily a V4: In particular, no relationship is assumed 
a priori between the metric and the connection, i.e. they are independent from 
each other. The Einstein-Palatini, metric-affine variational principle proceeds with 
the specification of a Lagrangian density £ that is constructed from the Riemann 



^■^This prescription is not free from a certain ambiguity in the sense that there are other 
generalisations — e.g., conformal coupling — of the aforementioned special relativistic form that 
are indeed compatible with the basic postulate of general covariance. 
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tensor of the connection and also the metric; which is therefore assumed to be 
a functional of the metric, its covariant derivatives up to a certain order, the 
connection coefficients, and their derivatives up to a certain order, that is (formally) 

£ = £(g, Vg, VVg, ...■T,dT, d'T, ...). (2.9) 

The analogue of the action functional (2.2) is 

S[g,T] = /d^flXi. (2.10) 

Ju 

Its variation under arbitrary independent variations of the metric and the connec- 
tion that vanish on the boundary 5 U is given by 



Ju ^ 



where 21^"'' and !Bab are the Euler-Lagrange derivatives of £ with respect to the 
connection and the metric respectively; hence the field equations are 

a/' = 0, ^ab = 0, 

sometimes called T- and g-equations respectively. 

As was first noted by Buchdahl, invoking such a method of variation in a 
gravitational theory is highly objectionable, for the Einstein-Palatini prescriptions 
ascribe to the variational principle the cumbersome task of picking out a specific 
class of spaces amongst all possible A^s [Buc60]: It is therefore implicitly assumed 
that one is dealing with a much broader geometrical setting than the familiar 
Riemannian space of general relativity; in fact, depending on the specific Einstein- 
Palatini Lagrangian, the general A^ could degenerate into a more specialised space 
as for instance a Weyl space W4, or it could remain totally unspecified, with a 
completely arbitrary metric tensor. In our point of view such arbitrariness in the 
variational principle is obnoxious; we recommend that the specific geometry one 
is dealing with be fixed ab initio even though this is not the usual Riemannian 
geometry (cf. Subsection 2.2.3). 

In the case of general relativity £ can be chosen as the Hilbert-Palatini La- 
grangian density £hp = q"''' Rab(^ , dT) , where the Ricci tensor depends on the 
connection coefficients and their first derivatives only; hence £„? is regarded as a 
functional of the 10 metric components and the 40 connection coefficients. It turns 
out that the Einstein-Palatini method of variation is technically simpler than the 
Hilbert method of variation described in the previous subsection. Varying the 
Hilbert-Palatini action with respect to the metric 5"^ one obtains directly 

5S= [ d^n RabdQ'''' = [ d^Q ^abSg""' = 0, (2.11) 

Ju Ju 
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whereas variation with respect to the afRne connection F"^^^ yields, by virtue of the 
contracted Palatini equation (2.8), 

JU JU L 

Integrating by parts and discarding the divergence term by the usual argument one 
obtains 



5S = I d^n 
'u 



Vb9'''ST'',,-V,Q''HT' 



lU ^ ' 

Since the variations ^F'^^^, symmetric in a and 6, and the region of integration It 
are arbitrary, the symmetric part of the expression in round brackets must vanish, 
i.e. 

5(Vd0")'^-V,0'^^ = O. (2.12) 

This latter equation is equivalent to the metricity condition Vcfl"^ = = Vc^"*; 
hence the connection coefficients F'^^^^ are necessarily the Christoffel symbols \^'^^- 
Therefore one deduces that the field equations obtained from equation (2.11) co- 
incide exactly with Einstein's vacuum field equations. This fact is the source of 
the commonly accepted belief that the Einstein-Palatini variational principle is 
equivalent to the Einstein-Hilbert variational principle. However, as we shall ex- 
emplify below, this is erroneous. As a matter of fact, for the Lagrangian L = R 
in vacuum, ^^ the equivalence of the field equations turns out to be a mere coin- 
cidence. Still, in that very case the two variational methods are not equivalent 
because the corresponding boundary conditions are different. As mentioned above 
(cf. footnote (13)), in the purely metric situation the boundary term occurring in 
the process of variation does not vanish in general since the metric only is held 
fixed on the boundary. By contrast, in the metric-affine case the boundary term 
comes to naught since, in addition to the metric, the connection also is held fixed 
on the boundary. This means that one should have to add an ad hoc surface term 
in order to recover the Hamiltonian description of the fields at spatial infinity in 
the case of asymptotically fiat space-times (cf. the discussion on boundary terms 
on page 94).^^ On the other hand, inasmuch as quantum gravity is concerned, the 

^'Observe that the 'gamma-ganinia' Lagrangian density is here ruled out as an alternative start- 
ing point for the derivation of Einstein's field equations because the difference between (2.6) and 
the Einstein-Hilbert Lagrangian density is a pure divergence only if the compatibility condition 
is assumed ah initio [ESU73]. 

^^This undesirable feature is also present in the Ashtekar formalism, as a direct consequence of 
the use of a Hilbert-Palatini Lagrangian; see, e.g., [Bom88]. 
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Einstein-Hilbert and Hilbert-Palatini Lagrangians — which are only equivalent 'on 
shell' — will most probably give different theories since in the path-integral approach 
for instance one sums over all 'off-shell' contributions to the action. Furthermore, 
the choice of £hp as the starting Lagrangian density for the Einstein-Palatini vari- 
ational principle is very peculiar: The most general second-order (torsion-free) 
metric-affine Lagrangian density does in fact involve additional terms of the form 
(Vg) • (Vg) (such as, e.g., ^ ayJ—g^bO"'^)- In other words there are no selection 
rules that enable to pick out the Hilbert-Palatini Lagrangian density, by contrast 
to what happens in the metric case, where the requirement of having second- 
order field equations together with the covariance property uniquely determines 
the Einstein-Hilbert Lagrangian density (up to a cosmological constant). In an 
interesting paper Burton and Mann have recently found, for the aforementioned 
general second-order metric-afhne Lagrangian density, the conditions under which 
the connection is in fact the Levi-Civita connection associated with the metric 
[BM98a]. They have shown that the compatibility condition, which arises natu- 
rally in the Hilbert-Palatini case, is in the more general situation a constraint that 
is induced by the breaking of a symmetry of the connection coefficients under the 
'deformation transformation' F'^^jj — > F'^^^ -|- C"^^j, (where C^^b i^ ^^ arbitrary ten- 
sor field symmetric in a and h). On the other hand, in the maximally symmetric 
case — i.e. when no constraints are imposed on the tensor field C^ah — the connec- 
tion remains completely undetermined but can always be chosen so as to recover 
Einstein's field equations. ^^ 

In the presence of matter fields there is an ambiguity in the prescription of the 
minimal coupling rule because the compatibility condition between the metric and 
the connection does not hold. Moreover, variation of the total action 

S= /d4or9^(g,r)+£M(g,^,VV)l, (2.13) 

gives the following pair of equations: 






<^ab = Zab := -2ti^, (2.14a) 



<5Ndr^'-Ve0'^^ = ||^, (2.14b) 

"-*- ab 

which are inconsistent in general owing to the fact that the purely geometric parts 
of these equations are projectively invariant whereas typical sources are not [HK78]. 
The equations (2.14) would be equivalent to the full Einstein field equations ob- 
tained via the Einstein-Hilbert variation only if the matter Lagrangian did not 



'^Curiously, the special choice T'^at = also turns the g-equations into Einstein's field equations. 
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depend explicitly on the connection, i.e. dS.j^/dT'^^f^ = 0. In most circumstances 
this will be the case (e.g., scalar, Yang-Mills, electromagnetic fields) but one can 
pick out examples where this is not. For instance, when Einstein-Dirac fields are 
involved, the connection directly couples to the gravitational field thereby breaking 
the equivalence.^^ 

Remark. In two dimensions in vacuum the Einstein-Palatini variation is unable to 
determine the connection completely [Des95]. This is can be seen as follows. For 
any dimension n, the equation (2.12) implies that the connection coefficients are 
given by the formula 

Kt = {;j + li^aQb + StQa - 9abQ'), (2.15) 

where we have defined Qa '■= — Va(ln-^— (7) = —5a (In \/—g) + F^ with F^ := F^^. 
The trace of equation (2.15) is 

ft \ 

l--)[9a(lny^)-Fj=0 

and identically vanishes when n = 2. Hence the F^ part of the connection is 
undetermined in two dimensions. 

For completeness we reiterate that the Ashtekar formulation of canonical gravity 
is grounded on a generalised metric-affine variational principle, often referred to as 
Hilbert-Palatini variation [JS88, Pel94]. To be more specific, the basic independent 
(complex) fields are expressed in terms of soldering forms and self-dual connections 
instead of metrics and affine connections respectively [Ash86, Ash87]. (Those new 
canonical variables can be obtained by a succession of canonical transformations 
from the canonical variables of tetrad gravity [HNS89].) The Lagrangian for pure 
(complex) gravity is the so-called self-dual Hilbert-Palatini Lagrangian and the 
Ashtekar Hamiltonian is obtained upon an appropriate Legendre transformation.^^ 
In Ashtekar's formalism classical general relativity stems from the real sector of 
the complex theory. As a nonperturbative approach to canonical quantum gravity 
the Ashtekar Hamiltonian formulation does not involve generalised — e.g., higher- 
order — Lagrangians; therefore the only circumstances where the equivalence men- 
tioned above breaks down arises when Dirac spinor fields are considered. In that 
case, because the spinors couple directly to the spin connection, the compatibility 
condition is altered by a term that gives torsion. Nevertheless, this problem can 
be avoided if an ad hoc term is added to the original Lagrangian [ART89]. 



^*It can however be restored by 'healing' the action with ad hoc compensating terms [WeySOb]. 
^^This formulation uses explicitly the aforementioned remarkable equivalence property of the 
Einstein-Hilbert and Einstein-Palatini variations. 
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2.2 Variational principles in higher-order gravity 

2.2.1 Metric variation 

In this subsection we write down the Euler-Lagrange derivatives stemming — via 
purely metric variation — from higher-order Lagrangians that are quadratic in the 
curvature tensors and from nonhnear gravitational Lagrangians. ^° Our main pur- 
pose is to provide the subsequent investigation of Chapter 4 with the explicit form 
of the field equations in vacuum and to enable a direct comparison with the equa- 
tions that will be obtained in the next subsection via the metric-afhne variational 
principle. In Subsection 2.2.3 we will show how the derivation of the fourth-order 
field equations can be overwhelmingly simplified by means the constrained first- 
order formalism. 

In a four- dimensional Riemannian space-time (M, g) the general gravitational 
action that contains, besides the Einstein-Hilbert term £eh = V—gR (and possibly 
a cosmological constant) of general relativity, quadratic invariant combinations of 
the Riemann curvature tensor, Ricci tensor, and scalar curvature is 



S 



d^n 



M 



£eh + Ti-S^i + 72-^2 + 73-C3 



(2.16) 



where 7^ for i = 1,...,3 are coupling constants and the quadratic Lagrangian 
densities are explicitly given by 



£1 
£2 
£3 



-gR^ 

-gR'^ Rab, 

— nabcd r? 
-gn rCabcd- 



(2.17a) 
(2.17b) 
(2.17c) 



abed Tjfi/ 



^abcd 



IS 



(Note that a fourth possibility exists, namely £4 := e R abR^fcd, where e 
the Levi-Civita tensor, but it is of no interest according to parity conservation.) 

The Lanczos action, which is constructed from the Gauss-Bonnet quadratic 
combination R"'^'^'^ Rabcd — ^R"'^Rab + R^ , becomes a topological invariant in four 
dimensions: the Euler-Poincare characteristic of the manifold (Gauss-Bonnet the- 
orem). The corresponding Euler-Lagrange derivative is the Bach-Lanczos identity 
[Bac21, Lan38]: 



lab 



-9 



r^acde f-ib 



cde 



^g^'^'CrArfC^'"'^ 



^cdef'- 



(2.15 



Owing to this remarkable property, only two of the quadratic invariants £j are 
independent in four dimensions. 

^°For a very detailed derivation of these results in the case of quadratic Lagrangians, we refer 
to Simon's undergraduate thesis [Sim88]. 
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Another interesting quadratic invariant is that constructed from the Weyl ten- 
sor, namely 



-g CabcdC' 



abed 



(2.19) 



The associated action is the conformally invariant action in four dimensions, which 
is uniquely determined upon gauging conformal symmetry, i.e. upon promoting the 
global conformal group to a local gauge symmetry [KTN77, Rie86]. 

The fourth-order Euler-Lagrange derivatives corresponding to (2.17) are re- 
spectively:^^ 



£f 



£f 



(\ab 
-^3 



1 



,ab Ty2 



ja-r-jbi 



^Clbr 



-g^^R^ - 2RR"' + IVVR - 2g'"'DR 
^-g<'^R^^R"^ - 2R^'""^R^d + VV^R - Di?"^ - -g^'^OR 



gabj^cdefj^^ 



cdef 



r) Tycdeb t> cl 



yab 



AnR""" + IVVR 



aY-ib] 



jbcad 



AR'^'^'R^d + AR'^'R" 



(2.20a) 
(2.20b) 

(2.20c) 



where the box D is the d'Alembertian second-order differential operator. The 
Euler-Lagrange derivative stemming from (2.19) is the tensor density associated 
with the Bach tensor Bab [Bac21], which is defined by 



Bab = 2V'^V Ccabd + CcabdR^ 

/ ?? \ 1 

= -U(Rab - -gab) + gVaVfoi? + {Ccabd + Rcabd + Rcb9ad)R'"^ 



(2.21) 



and is symmetric, trace- free, and conformally invariant of weight —1. 

Aside from the quadratic invariants above, we also consider a nonlinear La- 
grangian density that is a smooth arbitrary function of the scalar curvature, namely 



-gf{R), with/'Vo, 



(2.22) 



where a prime denotes differentiation with respect to the scalar curvature. The 
corresponding Euler-Lagrange derivative is 



f'Rab - ^fgab - "^a^bf + Qab^f = 



(2.23) 



and was first obtained by Buchdahl [Buc70]. 

^^This can be checked with the MathTensor package for Mathematica; see, e.g., [TPH96] 
for the conformally invariant case. 
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2.2.2 Metric-afRne variation 

As already mentioned in the introduction of this chapter, such alternative varia- 
tional principles as the purely affine or the metric-afhne methods of variation were 
put forth in the early years of the relativistic gravitational theory. They were first 
analysed in the framework of quadratic gravitational Lagrangians by Weyl [Wey50a] 
and Eddington [Edd23]. Later, with the aim of obtaining second-order field equa- 
tions different from Einstein's, Stephenson [Ste58, Ste59] and Higgs [Hig59] applied 
the Einstein-Palatini variational principle to the quadratic Lagrangians densities 
(2.17) and Yang investigated a theory based on the Lagrangian density (2.17c), by 
analogy with the Yang-Mills Lagrangian [Yan74]. However, as Buchdahl pointed 
out [Buc60], there was an error in Stephenson's method of variation — present in 
Yang's as well — as he imposed the metricity condition, i.e. the Levi-Civita con- 
nection, after the metric-affine variation. Buchdahl subsequently gave the correct 
field equations and demonstrated by means of specific examples that the Einstein- 
Palatini variational principle is not reliable in general [Buc79].^^ 

Recent investigations on those field equations derived form higher-order gravita- 
tional Lagrangians via the Einstein-Palatini variational principle may be reviewed 
briefly. The nonlinear f{R) case has been studied by several authors. In a se- 
ries of papers Shahid-Saless analysed the theory based on a i? + 71 i?^ Lagrangian 
with matter [Sha87, Sha90, Sha91]. This was generalised to the f{R) case by 
Hamity and Barraco who also studied the conservation laws and weak field limit of 
the resulting equations [HB93]. Ferraris, Francaviglia, and Volovich have recently 
shown that the Einstein-Palatini first-order formalism applied to general nonlinear 
/(i?) Lagrangians leads to a series of Einstein spaces, the cosmological constants 
of which being determined by the specific form of the function / [FFV94]. Similar 
results have been obtained in the case of f{RabR"'^) Lagrangians by Borowiec et al. 
[BFFV98, BF98]. First-order variations of a generic set of higher-order curvature 
terms appearing in string effective actions have been studied within the context of 
the cosmological constant problem by Davis [Dav98]. Tapia and Ujevic somehow 
extend the investigation made by Ferraris et al. on the universality property of 
Einstein spaces in the metric-affine setting [TU98]. As we shall see in Subsection 
2.3.2, their claim that "it is also possible to incorporate a Weyl vector field" and 
"therefore the presence of an anisotropy" proves to be wrong in the light of the 
constrained metric-affine method in Weyl geometry. 

In this subsection we first give the field equations that are derived from the 
quadratic Lagrangian densities (2.17) via the Einstein-Palatini variational principle 



^^In that respect it is strange that none of the most recent works using the Einstein-Palatini 
method refers to Buchdahl's article. 
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[Buc79]. Next we analyse in greater detail the Einstein-Palatini variation of the 
nonlinear Lagrangian density (2.22). We also indicate how Buchdahl's work can be 
extended to nonlinear Lagrangians of the type f{RahP°'^) [Mir97]; in particular, our 
analysis sheds another light on the 'universality property' most recently advocated 
by Ferraris et al. [FFV94, BFFV98] and by Tapia and Ujevic [TU98]. This impels us 
to present the Lagrange-multiplier version of the Einstein-Palatini method, namely 
the constrained first-order formalism (in Subsection 2.2.3), the consequences of 
which will be drawn in Section 2.3. 

Quadratic case 

Applying the Einstein-Palatini method of variation to the quadratic Lagrangian 
densities (2.17) one obtains the following sets of equations :^^ 

<5£i ^i?(i?(ab) - \R9ah) = 0, (2.24a) 

Ve(i?0"^)=O; (2.24b) 

62.2 -^RaRbc + R'aRcb " ^RcdR^^Qab = 0, (2.24c) 

\/^y\''b ^ Q. (2.24d) 



^^3 ^'^RcdaR" he - RacdeRh "" + R'^adeRcb "" " ■^RcdefR'' ''■'gab — 0, (2.24e) 

^^^Jab)d ^ Q_ ^2.24f) 



For each Lagrangian density the second set of equations has been obtained after 
partial integration, use of the Palatini equation (2.7), and elimination of a trace. 
These equations have been first given by Stephenson [Ste58, Ste59] and Higgs 
[Hig59]. Their important features are threefold: 

1. They are conformally invariant, that is invariant under conformal transfor- 
mations gab = ^^(x)5'afe, where J7^(x) is an arbitrary, strictly positive, scalar 
field. This can be easily understood on account of the fact that the affine 
connection is totally unrelated to the metric and because the Lagrangian den- 
sities from which they are derived are quadratic in the curvature. In a purely 
metric theory this property holds only for the Lagrangian density (2.19). 
Furthermore, Higgs was the first who showed that the solutions of equations 
(2.24) corresponding to -R-squared and Ricci-squared Lagrangians are confor- 
mal to Einstein spaces with arbitrary cosmological constants [Hig59, Buc79]. 



^^Only the symmetric part of the Ricci tensor has been retained in £2; cf. the remark on page 
25. 
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2. They are second-order differential equations. This is due to the first-order 
nature of the Einstein-Palatini variation, which crumbles those double co- 
variant derivatives of the curvature tensors that typically occur in purely 
metric variations. In particular, they are obviously not equivalent to the 
fourth-order field equations (2.20) obtained via purely metric variations. 

3. They do not yield the Levi-Civita connection as is the case for the linear 
Hilbert-Palatini Lagrangian and therefore their solutions are not Riemann 
spaces. However, in some specific cases it turns out that the underlined 
manifold is a 'disguised' W4, which can be brought onto an Einstein space 
upon a suitable conformal transformation (cf. the discussion on the nonlinear 
case below). 

Remark. The Ricci tensor is not necessarily symmetric as in the Riemannian con- 
text; on writing R(^ab) '■= Pab and R[ah\ '■= Qab we thereby see that to the Lagrangian 
density £2 in Vi there corresponds a whole family of Lagrangian densities in A4, 
namely £2,0 = \/—gRah{P"'^ + OiQ"'^), where a is an arbitrary constant. In other 
words there are no selection rules that enable to choose a = 0, i.e. £2, in the 
Einstein-Palatini variational principle. On the other hand, there are no a priori 
reasons whatsoever to discard other acceptable Lagrangian densities compatible 
with the Einstein-Palatini prescriptions. For example, one could start with that 
quadratic combination involving only the antisymmetric part of the Ricci tensor, 
namely 

ab 



-^2 •— V-g P-abQ 

However, in that case the ensuing field equations impose only four conditions upon 
the forty E's and leave the metric tensor totally undetermined [Buc79]. This dis- 
concerting situation also happens with the Ricci-squared Lagrangian: Equations 
(2.24c) and (2.24d) are fulfilled by any Ricci-flat, i.e. Rab = 0, affine space A4 
whereas the metric tensor remains quite arbitrary. This sort of things never hap- 
pens with purely metric variations, where the geometry is Riemannian from the 
outset. As Buchdahl, we claim that this degree of arbitrariness reflects discredit 
on the use of Einstein-Palatini variations in a theory of gravity. 



N 



onlinear case 



Applying the Einstein-Palatini method of variation to the metric-affine analogue 
of the nonlinear Lagrangian density (2.22), that is 



gflg'^'RabiT)), 
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one obtains the following set of equations: 

f'R{ab) - \fgah = 0, (2.25a) 

Vc(/V')=0. (2.25b) 

Obviously, they differ from the Euler-Lagrange derivative (2.23). To see what these 
equations imply, we first expand equation (2.25b), viz. 

/'V,0"^ + /"g'^Vei? = 0. 

Making use of the formulae V a\f--g = da^J—g — yJ—gVa and V aR = daR and 
assuming that /' / we may write the latter equation as 

9, (In V^) + (In f)'d,R - T\^ g^" + d.g'^' + T^^' + T^.^g'^'' = 0. (2.26) 

Contracting with gab we see that Tc is a gradient: 

r, = dc{in,/^)+2{inf'ydcR. 

This implies that the Ricci tensor is symmetric since R^ab] = ^[0^6]. Replacing 
Tc in equation (2.26) by the value given above and after little manipulation one 
obtains 

dc if gab) = f {riagdb + ri,gad) ■ (2.27) 

This suggests to define a conformally related metric with conformal factor /'; indeed 
in that case the latter equation becomes 

dcgab = riJdb + ri^gad, (2.28) 

thereby implying that the covariant derivative of the new metric gab with respect to 
the connection Vc vanishes: hence the connection T'^^f^ is the Levi-Civita connection 
of the metric gab- On the other hand, equation (2.26) is equivalent to 

Ve5"' = Ve(ln/>'^^ (2.29) 

which seems to show that the Einstein-Palatini variational principle has selected a 
W4, i.e. a Weyl space, with Qa = Va(ln/') as the Weyl one-form. However, this is 
not a W4 but a Riemann space with an 'undetermined gauge' [Sch54, p. 134]: Under 
a conformal transformation gab = ^ gab in a W4 the Weyl one-form transforms as 
Qa = Qa — 2Va (in Q) ; if Qa is a gradient, it is always possible to choose the 
conformal factor in order to have Qa = 0, which amounts to consider a Riemann 
space — this is the case here since we can 'gauge away' the spurious Weyl one-form 
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by choosing precisely Q"^ := f as the conformal factor. On the other hand the 
analysis of the field equation (2.25a) is more straightforward. Taking its trace we 
find 

f'{R)R = 2f{R). (2.30) 

This latter equation is identically satisfied (up to a constant rescaling factor) by 
the function /(-R) = R , as is expected from the fact that any quadratic invariant 
metric-afiine Lagrangian density is conformally invariant (cf. Point 1 on page 24). 
In that specific case the field equations (2.25a) reduce to 

Rab - ^Rgab = 0, (2.31) 

provided that f'{R) / 0, and the conformal gauge degree of freedom is borne by 
the scalar curvature, which is assumed to be strictly positive. Hence, under an 
arbitrary conformal transformation gab = k'^Rdab, with k'^ an arbitrary positive 
constant, the equation (2.31) becomes 

Rab - ^Rgab = 0, (2.32) 

where the conformal scalar curvature is constant, viz. R = /c~^. Indeed, the equa- 
tion (2.32) implies that the underlined manifold is an Einstein space with an arbi- 
trary cosmological constant. 

Remark. It is important to realise that, once the undetermined gauge has been re- 
moved, the conformal invariance is broken: The original 'fake' Weyl space has been 
replaced by an Einstein space — and Einstein spaces are of course not conformally 
invariant. In particular, the connection T'^^b being the Levi-Civita connection asso- 
ciated with the conformal metric gab, it is totally inaccurate to perform an inverse 
'conformal transformation' in which the connection remains frozen, as was the case 
in the departing counterfeit Weyl space. Strictly speaking it does not make any 
sense to go back to the original — illusory — space. In that respect what is found 
in the literature is either wrong (see e.g., Buchdahl's conclusion after his relation 
(3.7) [Buc79]) or misleading (see e.g.. Proposition 1 of Ferraris et al. [FFV94]). 

If on the other hand one excludes the purely quadratic case and assume that 
the function fyR) be given prior to the variation, then the trace equation (2.30) 
can be regarded as an algebraic equation to be solved for R. Denoting the resulting 
roots pi, p2, ■ ■ ■ one obtains a whole series of Einstein spaces, each characterised by 
a distinct constant scalar curvature. This situation was analysed by Ferraris et al. 
[FFV94] who extended Buchdahl's investigation [Buc79]. However, if it happens 
for some root pi to be such that f'{pi) = 0, then the trace equation (2.30) entails 
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that f{pi) = as well. In that instance the field equations (2.25) leave both the 
metric and connection completely undetermined.^^ 

As a special case of the nonlinear theory, one can also apply the Einstein- 
Palatini variation to scalar-tensor Lagrangian densities. Consider for instance the 
Brans-Dicke action [BD61]: 

S = fd^n./^ (fg'^^Rab - 4c^ 5"'V,(/)V„/.) . (2.33) 

The appropriate conformal factor for which the connection F'^^^ is the Levi-Civita 
connection associated with the conformal metric '(jab turns out to be the Brans- 
Dicke scalar field. Rather than transforming the field equations one can first trans- 
form the Brans-Dicke action and thereafter perform the Einstein-Palatini varia- 
tion. The resulting Lagrangian density is 

£bd = V^ (i? - ^ T''da(t> db^) . (2.34) 

This corresponds to the 'unit-transformed version' of Brans-Dicke's theory in which 
the gravitational constant is effectively constant and rest masses are varying ('Ein- 
stein frame') [Dic62].^^ As was first noticed by Van den Bergh, the peculiarity 
of the Brans-Dicke action is responsible for a curious fact: Although the purely 
metric variation and the Einstein-Palatini method are not equivalent when they 
are applied to the action (2.33), they yield the same dynamics if they are applied to 
the conformally transformed action constructed from (2.34); therefore the Einstein 
frame is singled out as the unit system in which both variational principles are 
equivalent [BerSl].^^ 

Creneralisations oi tne nonlinear case 

The above analysis can be performed on more general Lagrangian densities. By a 
completely analogous procedure one finds Einstein spaces for the following classes 
of Lagrangians: 

• L = f{RabP"'^), where Pab is the symmetric part of the Ricci tensor Rab 
[BFFV98]; 



^■^This situation is best illustrated with the Lagrangian L = R" for n > 2. Another interesting 
example is provided by the Lagrangian L = aR + bR + c with a, 6, and c constant: Applying the 
Einstein-Palatini variation one obtains qualitatively different conclusions according to the specific 
values of the constants a, b, and c; in particular, when b^ — Aac there is only one condition, i.e. 
R = constant, and nothing else is determined. 

^'The only difference lies in the coupling constant to, which is equal to the resettled Brans-Dicke 
coupling constant, i.e. a; = Wbd — |. 
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See also Burton and Mann's recent investigation [BM98b]. 
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• L = /(R), where R is any higher-order scalar constructed from the quantities 

<7'^^Peb(r) [TU98]; 

Firstly, we briefly consider the former case.^^ Let us define r := RabP"'^- Vari- 
ation of the corresponding action yields the following g- and F-equations: 

f'PacP% - \f9ab = 0, (2.35a) 

Vc(/'r')=0. (2.35b) 

From equation (2.35), after some manipulation, we conclude that Fq is a gradient; 
hence the Ricci tensor is symmetric and Pat can be replaced by Rab everywhere. 
Defining the 'reciprocal' tensor Pab (cf. [Buc79]) by PacP'^^ = S^ and setting p := 
det Pab we observe that the new metric that is defined by 

^ • ^ ^ -Pab (2.36) 



V-9 
or by 

nyab 

^"' = fpU-2 (2.37) 

induces the connection F'^^^ as its associated Levi-Civita connection. Expressing 
the field equations (2.35a) in the conformal frame described by the metric gab we 
find anew an Einstein space, viz. 

1 f / — 
^'^b = I ^f,ypi/29ab- (2.38) 

In the conformally invariant, i.e. quadratic (/' = 1) case it is easy to show that the 
conformal factor can once again be chosen to remove the undetermined gauge that 
characterises the illusory Weyl space in the original frame. This conclusion also 
holds for the n-dimensional conformally invariant Lagrangians studied by Tapia 
and Ujevic [TU98]. 

For the latter instance in the list above, varying the corresponding action with 
respect to the metric and connection one obtains 

- 2f9ab - f Pa"" ""Rbcde + fP'^adePcb ^ + '^fP'^daePc b"" = 0) (2.39a) 

Vi/'in/^)'^) = 0, (2.39b) 



"^See Miritzis's Ph.D. dissertation for a thorough investigation [Mir97]. 
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but in contrast to the previous cases there exists no natural way to derive a con- 
formal metric gab from the field equation (2.39b) unless the Weyl tensor vanishes 
[Dav98]. 

For completeness we mention that the inclusion of matter leads in general to 
inconsistencies. Moreover, in the nonlinear case the stress-energy tensor no longer 
satisfies the conservation equation; still, it is possible to define a new stress-energy 
tensor that is conserved but the physical interpretation of this generalised conserva- 
tion law can be put in doubt since in the linearised theory for instance the ensuing 
equation of motion for test particles contains an additional term, which disagrees 
with Newton's law (see [HB93]). However, most recently, Borowiec and Fran- 
caviglia have shown that for those classes of nonlinear gravitational Lagrangians 
that give Einstein spaces it is also possible to derive the Komar expression for the 
energy-momentum complex [BF98]. 

In order to avoid the difficulties inherent to the Einstein-Palatini variational 
method, we now consider its Lagrange-multiplier version. This modification of the 
metric-affine variational principle enables one to choose from the outset the type 
of geometry one wants to deal with. As regards the path-integral approach to 
quantum gravity, for instance, this seems more consistent. 

2.2.3 Constrained first-order formalism 

As we have indicated in the previous subsections, when one applies the Einstein- 
Palatini variational principle one is often confronted with difficulties: Any depar- 
ture from the Hilbert-Palatini Lagrangian density £„? = 9°'^ Rab(^ , dF) brings 
forth cumbersome features. Nevertheless, the main shortcoming of the Einstein- 
Palatini method is not that the ensuing field equations are typically not equiva- 
lent to those that are derived from the 'same' — in fact, they are different, since 
defined in different functional spaces — Lagrangian densities via the purely metric 
Einstein-Hilbert variational method but stems from the fact that it leads to certain 
indeterminacies, thereby vitiating its usefulness in a gravitational theory. 

However that may be, our purpose is not to maintain the status quo; but, if 
one's aim is to examine generalisations of the orthodox variational principle, one 
ought to be careful and to proceed gradually, conveyed by the guiding principle 
that one should recover known results as special, limiting, cases of the extended 
framework. We shall prove in this subsection that one consistent way of performing 
this programme is that which proceeds with the Lagrange-multiplier version of the 
Einstein-Palatini variational principle, referred to hereafter as the constrained first- 
order method of variation [CMQ97]. 

The use of Lagrange multipliers in a variational principle is advocated when- 
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ever one wants to impose some constraint on the independent variables, for a 
straight enforcement is in general not consistent with the variation. In the context 
of gravity this was clearly stated for the first time — as far as we are aware — in 
Tonnelat's book on Einstein's attempts towards a unified theory of the 'funda- 
mental fields' [Ton55]. In the context of Riemannian geometry this amounts to 
add to the Lagrangian density the compatibility condition as a constraint with La- 
grange multiplier; next, one varies the resulting action with respect to the metric, 
connection, and Lagrange multiplier, considered as independent variables. When 
the constrained first-order method is applied to the Hilbert-Palatini Lagrangian, 
the Lagrange multiplier identically vanishes as a consequence of the field equa- 
tions [Ray 75]. This result strengthens the interpretation of the equivalence of the 
Einstein-Hilbert and Einstein-Palatini variations in vacuum general relativity as 
a pure coincidence. Applied to higher-order gravitational Lagrangians the con- 
strained method proves to be technically much simpler than the purely metric 
equivalent variational method; this was demonstrated by Safko and Elston [SE76] 
and also by Ray [Ray 78]. Nevertheless, the interesting applications of the con- 
strained first-order method are those that are defined in the extended geometrical 
framework of MAG theories of gravity (see [HMMN95] and references therein). 
Consider a four- dimensional space-time manifold M endowed with a Lorentzian 

o 

metric gab and its associated Levi-Civita connection, denoted hereafter Vc or, in- 
differently, {a'l,}-^^ Contradistinctively to the purely metric variation of Subsection 
2.1.1 we want to relax the compatibility condition and allow for non-metricity dur- 
ing the variation; however, since we aim at recovering a specific type of space — e.g., 
Riemannian — after the variation, we must add to the original Lagrangian density 
£ (now considered as a metric-affine functional of the metric, connection, and pos- 
sibly matter fields ^p) the constraint 

£e(g,r, A) = y^A,'^^(r,, - {;j - CU), (2.40) 

where A'^^^ are Lagrange multipliers and C^^^ is the difference tensor between the 
arbitrary symmetric connection V^^j^ and the Levi-Civita connection {^^^}. For 

o 

instance, in Riemannian geometry, i.e. VcQab = 0, (2.40) takes the form 

Le(g,r,A)=A,'^^(r,,-{^^J), (2.41) 

whereas in Weyl geometry, i.e. VcQab = -QcOab, it is 



Lc(g,r,A)=A,'^^ 



1 



ab 



{ab}-oyiQb9ad + Qagdb-Qd9ab) ■ (2.42) 



^^The symbol V henceforth refers to the independent connection that is varied in the con- 
strained metric-afRne variational principle. 
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The resulting metric-afRne constrained action 



S 



d^f^ 



M 



£(g,r,v) + i:c(g,r,A) 



(2.43) 



must be varied with respect to the independent fields g"'^ , T'^^^, A^''^, and V'- Vari- 
ation with respect to the metric yields the g-equations 

(5£ 



6g' 



ah 



+ *B 



ah 



0, 



where the tensor density ^ab is defined by 



53 



ab 



1 



-2 V^V'=(A6ac + Kcb - ^cab)- 



-9 Bab ■ 

Variation with respect to the connection gives the F-equations 

5L 



6r 



ab 



+ A, 



0. 



(2.44) 



(2.45) 



(2.46) 



Variation with respect to the matter fields tp yields their respective equations of 
motion. Finally, variation with respect to the Lagrange multipliers restores the 
constraint (2.40). 

Consider the specific case of Riemannian geometry. ^^ Solving explicitly the 
F-equations (2.46) for the multipliers A^"* and substituting back the resulting ex- 
pression into the g-equations we should obtain field equations equivalent to those 
derived by 'Hilbert varying' the metric unconstrained action functional. This is ex- 
pected by construction of the constrained variational method and can be checked 
explicitly on the previous higher-order Lagrangians densities. We first write down 
the resulting g- and F-equations together with the actual values of the tensor i?"* 
for the Lagrangian densities (2.17):^° 



2^ 



A 



B 



ab 



ab 



2RR 



ab 



B 

-.ad A 



ab 



{2g''^8t - g'^^'Sl 
-2g''^UR+2V^'V"'R, 



= 0, 



(2.47a) 



for the Lagrangian density £i 
1 



-g^^RcdR"^ - R'"^RI - R^'^Rd + B 



A 
B 



ab 



ab 



2VrR 



5tVdR 



db 



S^VdR 



ab 
ad 



-Di?"* + 2VcV''i?"^ - g''''VdVcR^'^ 



(2.47b) 



^^The generalisation to Weyl spaces will be considered in Subsection 2.3.2. 
^°These formulae correct Safko and Elston's, which contain many misprints [SE76]. 
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for the Lagrangian density £2; and 



-g'^^RcdefR^'^'^^ - 2R''^'^'^R^^^ + S"^ = 0, 



abd I r)V7 D bad 



hr = 2VdRc + 'i^dRc, (2.47c) 

B"-^ = AVdVcR"^'"^, 

for the Lagrangian density £3. In the nonhnear case (2.22) we obtain hkewise: 



-/5"^ - f'R^"^^ + B"^ = 0, 

^ab ^ 1 ^^gab^d _ gad^b^ _ gdb^a^^j'^ (2.48a) 

^ab ^ -g^b^f + VV''/'. 

It is straightforward to obtain the correspondence with the Euler-Lagrange deriva- 
tives (2.20) and (2.23) respectively by taking into account the Riemannian con- 
straint and upon substituting in the g-equations, within each set of (2.47) and 
(2.48), the respective expressions of B"-'^. The outcome is: 

-c/"^i?2 _ 2RR^b _^ 2VV"i? - 2g''''aR = 0, (2.49a) 

-g'^'^RcdR^'^ - 2R'"""^Rcd + VV"i? - Di?"^ - -g^^Di? = 0, (2.49b) 

^''^RcdefR^'^^^ - 2R^'^^^R^ae " + ^VdVcR'""^'' = 0, (2.49c) 

f'R'^b _ K-gab _ yaybj^f ^ ^ab^j, ^ q_ (2.49d) 

Strictly speaking, in comparison with the usual Hilbert variation, in all cases con- 
sidered so far using the constrained first-order formalism, one starts from a different 
Lagrangian density, defined in a different functional space, follows a different vari- 
ational method but nevertheless ends up in an equivalent set of field equations. 
Bearing this in mind, one can trace back this equivalence — which is not a mere 
formal coincidence — from the fact that all our Lagrangian densities are diffeomor- 
phism covariant. 

All previous cases can indeed be considered as specialisations of a very general 
Lagrangian n-form constructed locally as follows, 

L = I'igab, ^a^gab, ■■■, ^ (a-, ■ ■ ■ '^ak)9ab, V', V^i V', • • • , "^ {ai ■ • • Va,)V', l) ■ (2-50) 

More specifically, L is a functional of the dynamical fields gab, V') finitely many 
of their covariant derivatives with respect to to Vc, and also other 'background 
fields' collectively referred to as 7. Referring to '(/ and V' as ^4>\ L is called /- 
covariant, f G Diff(M), or simply diffeomorphism covariant if L(/*((/))) = f*'L((p), 
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where /* denotes the induced action of the diffeomorphism / on the fields 4>. (Note 
that this definition excludes the action of /* on V or the background fields 7.) It 
immediately follows that our previous Lagrangians satisfy the above definition and, 
as a result, are diffeomorphism covariant. It is a very interesting result, first shown 
by Iyer and Wald [IW94] that if L as given in (2.50) is diffeomorphism covariant, 
then L can be reexpressed in the form 

o 00 

L = L(5afe) Rbcde, '^aiRbcde, ■ ■ ■ , '^ (ai ■ ■ ■ '^am)Rbcde, 

V',Vai^,...,V(,^...V„^)^), (2.51) 

o 

where Rabcd is the Riemann curvature of Vc and m = max(A; — 2,1 — 2). Observe 
that everything is expressed in terms of the Levi-Civita connection of the metric 
tensor and also that all other fields 7 are absent. Applying Iyer and Wald's theo- 
rem to our Lagrangians we immediately see that we could have reexpressed them 
from the outset in a form that involves only the Levi-Civita connection and not 
the original arbitrary connection V, and vary them to obtain the corresponding 
'Hilbert' equations. As we showed above, we arrived at this result by treating the 
associated Lagrangians as different according to whether or not they involved an 
arbitrary symmetric or a Levi-Civita connection. 

Remark. This does not mean though that any metric-afhne Lagrangian could be 
reexpressed as a purely metric Lagrangian. Here, we were allowed to apply Iyer and 
Wald's theorem because we started from a metric Lagrangian and by construction 
replaced it with a constrained metric-affine Lagrangian, where the connection is 
not in fact dynamical. 

2.3 Conformal structure of nonlinear gravitational La- 
grangians 

2.3.1 Conformal equivalence properties 

There is a huge literature involving conformal equivalence properties in the con- 
text of gravity theories. ^^ In this subsection we are mainly interested in recalling 
the well-known conformal equivalence property of nonlinear theories of gravity 
with Einstein's theory and additional scalar fields; in the next subsection we shall 
demonstrate how this classical result can be extended to Weyl geometry. We do 
not intend to discuss the 'physicality' issue arising from the aforementioned con- 
formal equivalence, that is, the problem of determining which metric amongst the 

^^For a very recent and thorough review we refer the reader to the work of Faraoni, Gunzig, 
and Nardone [FGN98] and references therein. 
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equivalence class of conformally related metrics is the physical one. Since the early 
criticisms of Brans [Bra88] , this question has raised a lot of controversies and mis- 
interpretations. In the context of stringy gravity it seems to be a crucial — albeit 
often skirted — and fairly intricate issue (see however Dick's constructive criticisms 
[Dic98]): One must indeed decide whether the rank- two tensor gab — which defines 
the 'Jordan', or 'string frame' — occurring in the effective actions of string theory is 
to be interpreted as the physical metric tensor or rather as a unifying object confor- 
mally related to the genuine metric tensor gab of the space-time through additional 
scalar fields. In that respect most theoretical arguments seem to favour the (con- 
formal) 'Einstein frame' as the relevant set of physical variables; this would also 
be true for a larger class of alternative theories of gravity including nonlinear theo- 
ries of the f{R) type, generalised scalar-tensor theories, and Kaluza-Klein theories 
after compactification of the extra dimensions: All these theories are dynamically 
equivalent — in the sense that their respective solution spaces are isomorphic to each 
other — to general relativity with scalar fields, which are still weird, hypothetical 
objects [Bra97]. 

The conformal equivalence of vacuum nonlinear gravity theories with general 
relativity plus a scalar field (see, e.g., [BC88]) can be proved within the extended 
framework of generalised scalar-tensor theories — which embody all theories involv- 
ing scalar fields besides the metric — , described by the following generic Lagrangian 
[Hwa97]: 

L = ^f{4>, R) - ^Lo{(t>) g'^'VacPVbcl) - V{ct)). (2.52) 

The easiest way to proceed is to perform the appropriate conformal transformation 
on the Lagrangian itself, not on the field equations. Indeed, on defining the confor- 
mal factor as fi^ = /' = exp(A /IV') it is straightforward to obtain the conformally 
transformed Lagrangian corresponding to (2.52), namely 

L = Ir- \j,T'^a(p^b<t^ - ^s'^'v.vv,^ - V{4>, V), (2.53) 

where the potential term is defined by 

V[<t>. ^) := ^^ {Rf -f + 2V). (2.54) 

The original generalised Lagrangian (2.52) has thus been 'cast' into the Einstein- 
Hilbert Lagrangian with an additional scalar field V' and an appropriate potential 
term V{4),4')- In most cases, introduction of a new scalar field 4> reduces the 
expression (2.53) to the final form 

L = \r- ^5"'V JVb0 - V{4>), (2.55) 
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provided that this new scalar field satisfies 



^d(^2_^dV'2. 



As a special case of the generic situation above, the conformally transformed 
nonlinear /(-R) Lagrangian yields the field equations 



Gab = ^a<P^b<P - -^dab 



Vc(pV^(l)) - 2V{(I)) , (2.56) 



with potential 



V{ct>) = ^{fy'[Rf'{R)-f{R)], (2.57) 



where it should be understood that R is superseded by cj) through the inverse 
function to f'{R)- 

The most important consequence of the conformal equivalence property in the 
context of higher-order gravity is that the original fourth-order field equations can 
be reduced to second-order equations, thereby simplifying the analysis of non- 
linear gravity theories (see, e.g., [Cot90, Mir97] and references therein). On the 
other hand, the dynamical equivalence of nonlinear theories and general relativ- 
ity plus additional scalar fields can be demonstrated by performing a Legendre 
transformation on the original set of variables (see, e.g., [Sok97] and references 
therein). This procedure — which amounts to transform the starting nonlinear the- 
ory into a scalar-tensor theory — was first used by Teyssandier and Tourrenc to 
solve the Cauchy problem for those theories [TT83] and is completely analogous 
to the generalised Ostrogradsky prescriptions studied in Chapter 3 (cf. p. 103); 
it can be generalised to Lagrangians that are functions not only of R but also 
d'^R for /c G N [Wan94]. Yet, it should be clear that the aforementioned Legen- 
dre transformation — sometimes referred to as a Helmholtz formalism [MS94] — does 
not play the role of the conformal transformation: Nonlinear gravity is dynamically 
equivalent to scalar-tensor gravity upon introducing additional scalar fields and the 
latter is then conformally equivalent to general relativity with one more scalar field 
[Wan94].32 

2.3.2 Generalised conformal structure in Weyl geometry 

For the more general higher-order Lagrangians of the form f{q) where q = R, 
RahR"'^ , or RahcdR"'^'^'^ and where / is an arbitrary smooth function, the field 



^^This slight difference is not merely conventional; it appears more clearly when one takes into 
account the boundary terms occurring in the respective actions. 
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equations obtained via the metric-connection formalism are of second order (cf. 
equations (2.39)) whereas the corresponding ones obtained via the usual metric 
variation are of fourth order. At first glance this result sounds very interesting 
since one could foresee that it would perhaps lead to an alternative way to 'cast' 
the field equations of these theories in a more tractable, reduced form than the one 
that is usually used for this purpose, namely the conformal equivalence theorem 
(cf. previous subsection). In this way, certain interpretational issues related to the 
question of the physicality of the two metrics associated with the conformal trans- 
formation would perhaps be avoided. Unfortunately, as indicated in Section 2.2, 
other difficulties arise when one uses the Einstein-Palatini method of variation, 
thereby vitiating its reliability as an alternative method to, for instance, reducing 
the complexity of the gravitational field equations. At the end of this subsection, 
we shall see that the origin of these troubles lies in the fact that the metric-affine 
variational principle is in fact a degenerate case of the constrained first-order for- 
malism, therefrom unable to cope with more general geometrical settings such as 
'true' Weyl spaces. 

We are now interested in investigating the consequences of applying the con- 
strained first-order formalism, defined in Subsection 2.2.3, to the case of Weyl 
geometry. We reiterate that a four- dimensional Weyl space W4 is an affine space 
L4 endowed with a Unear symmetric and semi-metric connection, that is 



"^cga 



-Qcg< 



ab: 



(2.58) 



where Qc is the Weyl one-form that characterises the geometry. The Weyl con- 
straint is (2.42); for convenience we recall its explicit form: 

1 



ic g,r,A 



A 



ab 



ab 



[ab] ~ od" {QbQad + QaQdb " Qd9a 



(2.59) 



We apply the constrained first-order formalism to the nonlinear Lagrangian L = 
f\R) in vacuum, making use of the contracted Palatini equation (2.8) and the 
following useful formulae: 



^Alb) 



^gCab 



25"^[Vfe((55ad) + Va{5gdb) - Vd{6gab)] 
1 



lz{Qb9ad + QaQdb - Qd9ab)^9 



cd 



:{Qbgad + QaQdb - QdQab)5g 



cd 



(2.60a) 



+ -^{Qb^Qad + Qa^Qdb - QdSgab)Q 



cd 



Variation of the nonlinear action 



S 



d^fl 



M 



gf{g'''Rab{r))+£c{g,r,A) 



(2.60b) 



(2.61) 
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with respect to the Lagrange multipliers A^"^ is trivial and recovers the definition of 
the Weyl connection in terms of the Levi-Civita connection and the Weyl one-form, 
that is 

r'ab = (a J + ^9"'^{Qb9ad + QaQdb " Qdffafe) • (2.62) 

Taking into account that Vq-^— 5 = —2Qa\/—g, variation with respect to the 
metric g"''' yields the g-equations 

f'R(ab) - \f9ah + Bab = 0, (2.63) 

where Bab is defined by (2.45). Eventually, variation with respect to the connection 
brings forth the explicit form of the Lagrange multipliers, namely 

A,"^ = 5(^(Q'^)/' - V^)/') - g^\Qj' - VJ'). (2.64) 

Substituting back the latter result into equation (2.45) we find the expression of 
the tensor Bab, that is 

Bab = "^Qia^b)!' - ^{aSb)f' + f'^{aQb) " f'QaQb 

- gab (2QcV7' - Q^f - □/' + /V^Qe) . (2.65) 

Inserting this result into equation (2.63) we obtain the full field equations for the 
nonlinear Lagrangian L = fyR) in the framework of Weyl geometry, viz. 

f'R{ab) - \f9ab - VaVbf + ^afeD/' = M,fe, (2.66) 

where Mab is defined by 

Mab := -2Q(,V6)/' - f'V^aQb) + f'QaQb 

+ gab (2QcV7' - Q^f + f'^Vc) ■ (2.67) 

Note that the degenerate case of vanishing Weyl one-form coincides with the fa- 
miliar field equations (2.23) obtained in the Riemannian framework. 

As discussed in the previous subsection, those equations are conformally equiv- 
alent to Einstein's equations with a self-interacting scalar field as the matter 
source. We henceforth aim at generalising this important property in Weyl ge- 
ometry. To this end, we define the metric gab conformally related to the metric 
gab with /' as the conformal factor. Owing to the fact that the Weyl one-form 
transforms as Qa = Qa — Va(ln/') and taking into account the formulae Va = Va, 
n = g"'^'Va^b = {f')~^^, the field equations (2.66) become in the conformal frame 
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where the tensor Mab is given by 



Mab = f'QaQb - f^iaQb) " V„Vfe/' 

+ ?a6(/'V^4-/'Q' + □/')• (2-68) 

Introducing the scalar field if := In f and the potential V{^) in the 'usual' form, 



VIZ. 



V{^) = ^{f')-\Rf'{R)-f{R)], (2.69) 

we can rewrite the field equations in the final form 

Gab = MabiQ)-gabV{ip), (2.70) 

where we have set 

Gab = R{ab) - -^Rgab (2.71a) 

and 

MabiQ) = QaQb - yiaQb)+9ab{y'Qc " Q') • (2.71b) 

Equivalently we could have obtained the field equations (2.70) from the correspond- 
ing conformally transformed action, via the constrained first-order formalism. 

The field equations (2.70) are Einstein's equations for a self-interacting scalar 
field matter source with a potential V{if) and a source term MabiQ) depending on 
the Weyl one-form Qa- If the geometry is Riemannian, i.e. Qa = 0, one recovers 
the standard conformal result. This will be the case only if the original Weyl 
one-form is a gradient, i.e. Qa = Va<I>, since in that particular case it can be 
gauged away by the conformal transformation gab = (exp ^)gab (cf. the discussion 
on page 26). This actually was the case of the Einstein-Palatini variational method 
applied to the Lagrangian L = f{R), where the Weyl one- form turned out to be 
Qa = Va(ln/'). This fact shows unambiguously that unconstrained metric-affine 
variations of the Einstein-Palatini type cannot deal with a Weyl geometry and 
correspond to a degenerate case of the constrained first-order method: The field 
equations obtained from the former can be recovered within the constrained setting 
simply by choosing a very special form of the Weyl one-form [Que97] that makes 
the Weyl space degenerate in a Riemann space. As a direct consequence of our 
investigation, Tapia and Ujevic's claim that, upon making use of the Einstein- 
Palatini variation, it is possible to incorporate a Weyl vector field — and therefrom 
account for the would-be observed anisotropy in the universe — is invalidated since 
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this vector field describes an undetermined gauge in a Riemann space and can 
therefore always be eliminated [QMC99]. 

Let us summarise the results obtained in this subsection. Our analysis has re- 
vealed that a consistent way to investigate generalised theories of gravity, without 
imposing ab initio that the geometry be Riemannian, is the constrained first-order 
formalism. Applications to quadratic and /(-R) Lagrangians in the framework of 
Riemannian and Weyl geometry show that the unconstrained Einstein-Palatini 
variational method is a degenerate case corresponding to a particular gauge and 
that the usual conformal structure can be recovered in the limit of vanishing Weyl 
one-form. (The generalisation of the result stated above to include arbitrary con- 
nections with torsion could be an interesting exercise.) 

The physical interpretation of the source term in the field equations (2.66) is 
closely related to the choice of the Weyl vector field Q". However, it cannot be 
interpreted as a genuine stress-energy tensor in general since, for instance, choosing 
Q" to be a unit timelike, hypersurface-orthogonal vector field, the sign oi MabQ"'Q^ 
depends on the respective signs of f'{R) and 'expansion' VaQ""- 

The generalisation of the conformal equivalence theorem opens the way to 
analysing cosmology in the framework of these Weyl nonlinear theories by methods 
such as those used in the traditional Riemannian case (cf. [Cot97]). All the related 
problems could be tackled by leaving the conformal Weyl one-form Qa undeter- 
mined, while setting it equal to zero will eventually lead to detailed comparisons 
with the results already known in the Riemannian case. 
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Chapter 3 

Hamiltonian formulation of 
higher-order theories of gravity 



"And it is right that you should learn all things, both 
the persuasive, unshaken heart of Objective Truth, and 
the subjective beliefs of mortals, in which there is no 
true trust." 

— Parmenides of Elea, On Nature (Peri Physis). 



WITHIN the realm of classical mechanics where it sprang up the Hamilto- 
nian formalism chiefly served the purpose of a tool for tackling dynam- 
ical problems. So far Hamiltonian methods have demonstrated their 
ability to prove profound results — the renowned KAM theory, for example — and 
to solve otherwise intractable problems in various fields of mathematical physics: 
celestial mechanics, ergodic theory, statistical mechanics, and so forth [Arn76]. In 
addition the conceptual framework provided by the canonical formalism has be- 
come a convenient starting point for quantisation. This is the main reason why 
Hamiltonian methods have been applied to gravity as early as 1930 by Rosenfeld 
who constructed a quantum-mechanical Hamiltonian for the linearised theory of 
general relativity — though he did not make any attempt to develop a canonical 
version of the full theory. This last purpose was carried out with the pioneering 
works of Dirac, Bergmann and many others, and ended up as a consistent canonical 
formalism of Einstein's general relativity: the celebrated ADM formalism. (For an 
historical perspective, see [DeW67].) 

Besides the conceptual interest in the canonical version of general relativity, 
more restricted investigations using Hamiltonian methods as technical tools to 
tackle specific problems deserve consideration: Hamiltonian cosmology [Ugg97], 
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relativistic celestial mechanics and black hole physics (see references in [Bei94]), 
Hamilton-Jacobi theory in relativistic [Sal98] and string cosmology [Say97]. 

In this chapter we focus on the Hamiltonian formulation of theories with higher 
derivatives and its application to higher-order gravity theories. We do not address 
technical issues related to the quantisation of these theories; rather we concentrate 
on their classical structure, hoping to show that the canonical formulation can 
provide a very powerful method for reducing the order of the equations of motion. 
Applications of the Hamiltonian formalism to spatially homogeneous cosmologies 
is left to Chapter 4. We firstly review to some extent in Section 3.1 how to deal 
with first-order constrained Hamiltonian systems. Then in Section 3.2 we give 
a detailed analysis on the treatment of higher-order field theories by means of a 
generalisation of the so-called Ostrogradsky construction. Eventually we do attack 
the most important topic of this thesis in Section 3.3, namely the Hamiltonian 
formulation of higher-order theories of gravity. In particular, we exemplify the 
effectiveness of the Ostrogradsky method by building up a canonical version of 
gravity theories described by a Lagrangian that is an arbitrary function of the 
scalar curvature. This conveys us to prove the chief result of this chapter, that is 
the equivalence of the Ostrogradsky Hamiltonian formulation of general relativity 
and the well-known ADM canonical formalism [Que98]. 

3.1 Constrained Hamiltonian systems 

3.1.1 Introduction 

Physical theories of fundamental significance are invariant with respect to some 
group of local symmetry transformations — gauge transformations for Yang-Mills 
theories; space-time diffeomorphisms for gravity. Such theories are generically 
called gauge theories and can be thought of as theories in which the physical sys- 
tem under study is described by more variables than the number of physically 
independent degrees of freedom. The physically prevailing variables are those that 
are gauge invariant or, in other words, independent of the specific local symmetry 
transformation applied on the system. It is an essential characteristic of a gauge 
theory that the general solution to the equations of motion involves arbitrary func- 
tions of time: The local symmetry relates different solutions stemming from the 
same initial conditions. In the Lagrangian formalism this means that gauge theories 
are singular systems (cf. the remark on page 54). 

It is unanimously acknowledged that the most exhaustive and reliable treat- 
ment of gauge systems is that which proceeds through the Hamiltonian formulation. 
Nevertheless it is worthwhile to start from the action principle in Lagrangian form 
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and proceed to the Hamiltonian formulation. Then the very presence of arbitrary 
time-dependent functions in the general solution of the equations of motion im- 
plies that the canonical variables are not all independent: There are conditions on 
the allowed initial momenta and positions. These relations amongst the canonical 
variables are called constraints. As a consequence, all gauge theories are systems 
with constraints — the converse, however, is not true. This is the reason why most 
textbooks on the quantisation of gauge systems proceed firstly to analysing Hamil- 
tonian constrained systems — Henneaux and Teitelboim's book is perhaps the best 
example in that respect [HT92]. (Historically, the classical Hamiltonian formal- 
ism has rather exclusively been considered as the fundamental setting on which 
canonical methods of quantisation are rooted.) 

In contrast with the old approaches in theoretical physics, the aim of which 
was to reduce the number of variables entering in the play, the modern way of 
dealing with fundamental systems consists in introducing more powerful — gauge — 
symmetries while increasing the number of variables to make the description more 
transparent. This philosophy has culminated with the inception of the elegant and 
powerful BRST formalism; see, e.g., [HT92]. 

There exists a striking difference between gauge theories with internal sym- 
metries and those which are generally covariant, that is, with reparameterisation 
invariance: In the former local symmetry transformations are generated by first- 
class constraints; in the latter the Hamiltonian itself is a constraint — the super- 
Hamiltonian. (In most circumstances, generally covariant systems do have a zero 
Hamiltonian; there exist, however, counterexamples to this property [HT92].) This 
raises the issue of interpreting time in generally covariant theories, for one is led 
to the question whether the Hamiltonian generates the dynamical time evolution 
or the kinematical local symmetries as the other first-class constraints usually do. 
In quantum gravity this issue is particularly puzzling since it is intrinsically linked 
to the various ways of foliating space-time in a one-parameter family of spacelike 
hyper surf aces; see, e.g., [AS91, Ish94], and references therein. 

Circa 1950, the classical treatment of constrained systems was carried out by 
Dirac [Dir50, Dir58a, Dir64], Bergmann and collaborators [AB51, BG55]; almost 
instantaneously, Pirani and Schild applied Dirac's methods to the gravitational 
field [PS50]. Dirac himself was mainly concerned in the Hamiltonian formulation of 
general relativity [Dir58b] and his leading efforts were completed in the sixties with 
the work of Arnowitt, Deser and Misner: the famous ADM formalism. Specifically 
they showed how to use the canonical setting to provide a rigorous characterisation 
of gravitational radiation and energy [ADM60, ADM62]. (As compared to the 
gravitational case, the application of Dirac's methods to gauge theories — Maxwell 
electrodynamics and Yang-Mills theory — has not been so hastily initiated; see, e.g.. 
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[Sun82], and references therein.) 

The geometrisation of the Dirac-Bergmann algorithm was achieved in the late 
seventies by Gotay, Nester, and Hinds [GNH78, GN79, GN80]. In the eighties, 
Batlle et al. obtained general proofs showing that the classical Hamiltonian and 
Lagrangian treatments of gauge theories are equivalent [BGPR86]. The Dirac- 
Bergmann theory opened the way up to the quantisation of constrained Hamil- 
tonian systems even though canonical methods were hard to apply to theories of 
physical interest. Two distinct techniques emerged: the Dirac method of quantisa- 
tion, in which the constraints are implemented as operators in Hilbert space, and 
the reduced quantisation, where the superfluous phase space degrees of freedom are 
eliminated before quantising, as is the case in the adm approach. Their equiva- 
lence is still a matter of controversy; see references in Garcia and Pons's article 
[GP97]. The development of path-integral quantisation methods, incorporating 
the constraints in the definition of Feynman path integrals, and the extension of 
the formalism to include fermionic fields — upon the introduction of Grassmann 
variables — brought forth an appreciable advance which reached its apex with the 
advent of the powerful Hamiltonian BRST and Lagrangian antifield formalisms. 

Besides the classical lectures of Dirac [Dir64] there are several excellent reviews 
on the treatment of constrained systems. Some focus more on systems with a finite 
number of degrees of freedom [SM74], others on field theories [HRT76], and some 
on both [Sun82, Gov91, HT92, Wip94, Bur97]. For generally covariant theories 
there exists a good monograph [GT90]. We draw on all these valuable sources 
to briefly summarise the theory of constrained systems, with the main purpose of 
defining the basic concepts and setting the notations that will be used throughout 
this chapter. 

3.1.2 A short summary of Dirac-Bergmann theory 

Action principle in Lagrangian lorm — Singular Lagrangians 

Consider a conservative holonomic dynamical system the configuration of which at 
any instant of time is specified by K independent generalised coordinates q^ for 
A; = 1, . . . , -fC; its time evolution can be derived from an action functional. 



S[l]= r L{qu{t),qu{t))dt, (3.1) 

Jtn 



constructed from an appropriate Lagrange function L(g, q) that depends explicitly 
on the generalised coordinates and their velocities;^ it is understood that to any 

^At this stage we restrict the discussion to first-order Lagrangians with no explicit time de- 
pendence. Systems with higher derivatives are dealt with in Section 3.2. Moreover we assume 
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path 7 = Qkit) in configuration space that takes the initial and final values qkito) 
and qk(ti) one associates the value ^[7] of the action, which is given by the integral 
(3.1). The classical motions of the system are those that make the action (3.1) 
stationary under variations Sqk{t) of the generalised coordinates Qk that vanish at 
the endpoints to, h. The necessary and sufficient conditions for the action (3.1) to 
be stationary are the Euler-Lagrange equations 



d fdL\ dL 
dt\dqkj oqk 



(3.2) 



We can write equations (3.2) in more detail as 



K 

^fc('7,'7)- J] VFH(g, 9)^-^ = forfc = l,...,K, (3.3) 

1=1 

where we have introduced the quantities 

V.(g,g):=|^-f;^, (3.4a) 

9Qk ^^ dqidqk 

Wki{qA)-=4^- (3.4b) 

' dqidqk 

We immediately see from expressions (3.4) that all equations (3.3) are second-order, 
linearly independent equations, provided the Hessian matrix can be inverted, i.e. 
det W j^ 0. Then the accelerations ijk at a given time are uniquely determined 
by the positions and the velocities at that time and the Lagrangian is said to be 
regular. The general solution to the equations of motion can thus be expressed 
in terms of 2K independent constants of integration which are fixed by the initial 
conditions. If, on the other hand, the determinant of the Hessian matrix is zero, 
the accelerations — and thus the dynamics — will not be uniquely determined by the 
positions and the velocities. The general solution to the equations of motion will 
then possibly involve arbitrary functions of time. In contradistinction with the 
former case the Lagrangian is said to be singular. 

Henceforth we assume that the Lagrangian L is singular and that the rank of 
its associated Hessian matrix is constant everywhere on the velocity phase space 
and equal to K — R ior R £ N, viz. 

rank -^— ^ = K - R. (3.5) 



dqkdq 



that there is a finite number of discrete degrees of freedom in order to render this summary as 
simple as possible. 
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Therefore, the matrix W has R null-eigenvectors X^^': 

K 

^xf^(g,g)H^fc/(g,g) =0 fori = l,...,i?. (3.6) 

1=1 

Contracting the Euler-Lagrange equations (3.3) with those eigenvectors we obtain 

necessary and sufficient conditions so that equations (3.3), interpreted as algebraic 

equations for the unknown g^, have a solution; these are 

K 

H<i^q)-=Y.^t\<i^^)M<i^^)=^ fori = l,...,i?. (3.7) 

fc=i 

The independent conditions amongst equations (3.7) are called Lagrangian con- 
straints. 

Remark. Some null-eigenvectors can be determined from the generalised Bianchi 
identities, which are obtained through Noether's second theorem; as a direct con- 
sequence, gauge theories are necessarily singular (see, e.g., [Wip94]). 

Hamiltonian lormalism 

Primary constraints. The starting point for the Hamiltonian formalism is to define 
the canonical momenta by 

dL 
Pk-=Tr^ ioik = l,...,K. (3.8) 

OQk 

We see that the vanishing of the determinant of the Hessian matrix W is precisely 
the condition that precludes the expression of the velocities as functions of the co- 
ordinates and momenta. In other words, the momenta (3.8) are not all independent 
in this case, and there exist amongst equations (3.8) some relations 

(l)i{q,p)=0 fori=l,...,R, (3.9) 

which are assumed to be independent. The conditions (3.9) are called primary 
constraints to emphasise that the equations of motion were not used to obtain 
them. Through equations (3.9) these primary constraints define a {2K — R)- 
dimensional submanifold — the primary constraint surface — , which we suppose to 
be smoothly embedded in phase space. 

Generalised Legendre transformation — Canonical and Dirac Hamiltonians. The 
canonical Hamiltonian is introduced by 

K 
H^:=^qkPk-L. (3.10) 

k=l 
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Even though H^ as defined by equation (3.10) is a function of the positions and 
the velocities, its dependence on (g, q) is quite specific. Indeed, it is a remarkable 
property of the Legendre transformation that the velocities enter H^ only through 
the momenta given by definitions (3.8). This follows simply from the evaluation of 
the change 5Hc induced by arbitrary independent variations of the positions and 
velocities. This means that H^. is a function of the p's and g's. However, it is not 
uniquely determined as a function of the phase space variables since the variations 
5pk of the momenta are restricted to preserve the primary constraints (3.9). In 
other words, the canonical Hamiltonian is well defined only on the submanifold 
characterised by the primary constraints, and it can be extended arbitrarily off 
that manifold. The resulting function, which is not unique, is given by 

R 
H^:=H, + Y,uj\q,p)cl),, (3.11) 

and is called a Dirac Hamiltonian. The canonical formalism remains unchanged 
under the replacement He -^ Hxi- 

The passage from g, q, L(q,q) to q, p, Hx){q,p) is called a generalised Legendre 
transformation. It enables to cast the action principle into Hamiltonian form: The 
Hamiltonian equations of motion 

• - ^Ih + \^ i^ 
dpk j-[ dpk' 

dHc sr^ id(t)i 
o<lk ^ oqk 

which are equivalent to the original Euler-Lagrange equations (3.2), can be derived 
from the variational principle 

fti / ^ ^ . \ 

S i^qkPk-Hc-^uj'(pijdt = 0, 

for arbitrary variations 5qk, 6pk, Su^ subject only to the restriction that 5qk vanish 
at the endpoints. The variables w*, which render the Legendre transformation 
invertible, play here the role of Lagrange multipliers that are enforcing the primary 
constraints (3.9). 

Poisson bracket — Weak and strong equations. Introducing the Poisson bracket of 
two arbitrary functions /, g, defined in phase space, by 

1/ ^1 .= f^ K^ _ f^ K^ (3 12) 

u,^1k dpk f-' dpk dqt ' 
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we can formally write any equation of motion in the canonical formalism as 

R 

f = {f,H^} = {f,H,} +Y,^'{f,^^}■ (3.13) 

At this stage it is useful to distinguish between weak and strong equations. The 
primary constraints (3.9) do not vanish identically throughout phase space and, 
in particular, they have nonzero Poisson brackets with the canonical variables. To 
take account of this property we write the primary constraints (3.9) as 

^^{q,p)^0 foii = l,...,R, (3.14) 

where we have introduced the weak equality symbol "~". More generally, two 
functions /, g that coincide on the primary constraint submanifold are said to be 
weakly equal, viz. f ^ g. On the other hand, an equation that holds throughout 
phase space is called strong, and the usual equality symbol "=" is used in that case. 
Accordingly, we may write equations (3.11) and (3.13) respectively as 

R 
H^ = H^ + J2^'{Q,p)(pt-Hc, (3.15a) 

1=1 

and 

R 

f ^ {f,H^} ^ {f,H,} + Y,^'{f,(|>^}■ (3.15b) 

Dirac-Bergmann algorithm — Total Hamiltonian. A basic consistency requirement 
is that the primary constraints be preserved when time evolution is considered. 
This gives rise to the consistency conditions [Dir50, AB51] 

R 
4>i^{<Pi,Hc}+Y,^^{(t>iAA-0 ioii = l,...,R, (3.16) 

i=i 

which, for each value of the index i, correspond to one of the three possibilities: 

! 

1. Equation (3.16) is trivially satisfied, i.e. « 0. 

2. Equation (3.16) is actually an equation to be satisfied by the Lagrange mul- 
tipliers U). 

3. Equation (3.16) reduces to a relation independent of to, that is, a new con- 
straint on the p's and the g's which is said to be secondary — in contrast with 
"primary" — to emphasise that the equations of motion were used to obtain it. 
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For any secondary constraint we must again impose a consistency condition similar 
to (3.16) and perform the above analysis. This process must be repeated until all 
consistency equations have been exhausted. The outcome of this algorithm then 
consists in: 

1. a complete set of R primary and S secondary constraints, 

0„«O foim = l,...,R + S = M; (3.17) 

2. restrictions on the Lagrange multipliers co, 

R 

{^m,H,} +Y,^'{<PmAi} -0 form = l,...,M. (3.18) 

1=1 

Conditions (3.18) may be thought of as a set of M nonhomogeneous linear equations 
in the R unknowns w*. (Note that since R ^ M some compatibility conditions — 
i.e. further secondary constraints — could arise to ensure consistency; they must be 
treated through the algorithm as well.) The general solution of equations (3.18) is 
of the form 



w « /i 



+ X^AX, (3.19) 



a=l 



where //* is a special solution of equation (3.18) and i/^ is a complete set of A 
linearly independent solutions of the associated homogeneous system 

R 

Y,^a{(Pm,(pi}-0. (3.20) 

j=l 

Since the coefficients A° are totally arbitrary, equation (3.19) means that the mul- 
tipliers to have been resolved into one arbitrary part and one part that is fixed by 
the consistency conditions (3.18); for more details, see [HT92]. 

As a direct consequence of this analysis we may write the equation of motion 
(3.15b) equivalently as 

f-{f,HT}, (3.21) 

with the total Hamiltonian, 

A 
H^ = H'^ + Y,r4>c., (3.22) 

a=l 
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which is defined as a sum of a Dirac Hamiltonian -ff^, 

R 

H'^ = Hc + Y,f^'(t>i, (3.23a) 

i=l 

and a specific Hnear combination of the primary constraints (pa, 

R 



Y.yl<P^. (3.23b) 



Therefore the general solution to the canonical equations (3.21) will involve A 
arbitrary functions of time, as expected for a singular system. 

First-class and second-class constraints. The concept of first-class and second- 
class functions on phase space was introduced by Dirac [DirSO, Dir64]; it plays a 
central role in the Hamiltonian formalism. 

A function f{q,p) is said to be first class if its Poisson brackets with every 
constraint vanish weakly, that is 

{f,<Pm}-0, ym = l,...,R. (3.24) 

The set of first-class functions is closed under the Poisson bracket [Dir58a]. Directly 
available examples of first-class functions are H'rj^ and (/>«, defined by (3.23a) and 
(3.23b) respectively. Hence the total Hamiltonian (3.22) is the sum of the first- 
class Dirac Hamiltonian i7^ and an arbitrary linear combination of the primary 
first-class constraints (pa- The number of arbitrary functions A" is thus equal to 
the number of primary first-class constraints. What makes first-class constraints 
so important is that they generate gauge transformations.^ It indicates there is 
more than one set of canonical variables that corresponds to a given physical state. 
To overcome this ambiguity further restrictions may be imposed on the canonical 
variables: one proceeds to a gauge-fixing procedure (see, e.g., [Bur97]), which is not 
unique; its specific implementations actually determine the corresponding methods 
of canonical quantisation — e.g., Dirac quantisation, BRST method, reduced phase 
space quantisation [Bur82].^ 

Conversely, if there exists at least one constraint such that its Poisson bracket 
with / does not vanish weakly, then / is said to be second class. If X/3 denotes a 



^For a more thorough discussion on the exact meaning of first-class constraints, especially 
with regard to some controversy found in the literature on the so-called Dirac's conjecture, see 
[Gov91, HT92]. 

^In the BRST approach, the phase space is extended rather than reduced. 
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complete set of second-class constraints, then the matrix C, the elements of which 
are precisely the Poisson brackets between the x's, 

C/3/3' = {X/3,X/3'}, (3.25) 

is nonsingular and its inverse is denoted as Cr^, = C^^ . (This property may also 
serve as an alternative definition of the second-class constraints [Bur97].) 

Dirac bracket. Second-class constraints are associated with redundant degrees of 
freedom which could be solved in terms of the other degrees of freedom provided 
that a generalised Poisson bracket referring to the remaining degrees of freedom 
only would be defined. This is achieved with the Dirac bracket, which is defined 
by 

{f,9}^ ■■= {/,<?} - E{/'^/3} C"""' {XP',9}, (3.26) 

for any arbitrary functions f,gon phase space, and which is always consistent 
with the second-class constraints. 

Owing to definition (3.26), second-class constraints become strong equations, 
and we may write the equation of motion (3.21) as 

f^{f,H'}^, (3.27) 

where H* stands for any first-class Hamiltonian that generates time evolution: 
for instance, the total Hamiltonian Hq^, or a more general choice with the extended 
Hamiltonian He which includes, in the linear combination of first-class constraints, 
secondary first-class constraints as well (see [HT92, Gov91] and footnote (2) on 
page 58). 

Crenerally covariant systems 

Systems that are invariant under arbitrary coordinate transformations are analo- 
gous to the parameterised form of mechanics in which the Hamiltonian and the 
time variable are introduced as a conjugate pair of canonical variables correspond- 
ing to a new degree of freedom. The resulting theory is invariant under an arbitrary 
reparameterisation. In field theory one can also introduce arbitrary labels for the 
spatial coordinates: The theory becomes invariant under an arbitrary change of the 
space-time coordinates, just like generally covariant theories. This is the reason 
why theories of gravity are said to be already parameterised systems. The 3-1-1- 
splitting of space-time, which is a crucial step towards their canonical formulation 
(see Section 3.3), has the virtue of "de-parameterising" the theory. 
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Time as canonical variable. Consider the action functional (3.1) and assume for 
simplicity that the Lagrangian L be regular. In order to raise the physical time 
t as a canonical variable we parameterise the theory with respect to a new time 
parameter r such that t = t(r) and dt/dr ^ 0. The original time is then interpreted 
as an additional generalised coordinate, i.e. qo := t. The action functional (3.1) 
thus becomes 



/•n „/ i-Ti 

S= At % L{qk, -j) = I dTLr{qm,q'm) for m = 0, . . . ,K, 

Jrn % Jrn 



(3.28) 



where a prime denotes differentiation with respect to the parameter r. It is invari- 
ant under any further reparameterisation r* = t*{t). The conjugate momenta are 
given by 

pV = ^ = Tr^=Pk foTk = l,...,K, (3.29a) 

dql dqk 

p^ = ^=L-Y,--qk = -H{qk,Pk). 3.29b 

Hence relation (3.29b) is a primary constraint, 

H^iq.p) ■■= Pt^ + H{qk,Pk) ~ 0, (3.30) 

and the canonical Hamiltonian 

K 



Hi^^ = E 9-P- -Lr = q'o Ml^P) (3-31) 



m=0 

identically vanishes — a striking feature due to the parameterisation invariance. 
Since Hq f» is the only constraint, the total Hamiltonian is simply 

H^^^ =N{T)Ho{q,p), (3.32) 

where N{t) is an arbitrary function which is interpreted as a Lagrange multiplier 
in the variational principle 

K 

^ r d^ ( E '?« P™ - ^(^)^o {q,p))=0. (3.33) 

J TO Vm=0 ^ 

(t) 

In particular, variation of (3.33) with respect to the momentum p^ yields N{t) = 
dt/dr; hence the original canonical theory is recovered if we choose the synchronous 
temporal gauge, i.e. A^ = 1. 
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Generalisation to £eld theory. The extension of the above parameterisation pro- 
cedure to a field theory that is described by a Lagrangian density 2[cl),d^(j)j is 
achieved by interpreting the space-time coordinates as four new field variables, i.e. 
x^ = x^{Xj^). Thus there are four extra conjugate momenta; four constraint equa- 
tions H^ K, are required to relate these momenta to the Hamiltonian density and 
the field momentum density. It is convenient to consider a foliation in terms of hy- 
persurfaces which are labelled by constant 'times', that is op = const. Let n" be the 
unit normal vector field to these hyper surf aces. Then we decompose the original 
set of constraints H^ into their parts normal and tangential to the hyper surf aces, 
namely H± and Hi for i = 1,2,3 respectively. The ensuing total Hamiltonian is 

3 

Ht = NH± + Y^ N'Hi. (3.34) 

i=l 
In the context of general relativity the primary constraints H± and Hi are called the 
super- Hamiltonian and the super-momentum respectively; see [MTW73, page 521]. 
The four corresponding Lagrange multipliers N^ specify how to move forward 
in time from one hypersurface to the other as well as onto one and the same 
hypersurface — they are called the lapse and shift functions respectively; see, e.g., 
Kuchaf's review for a very detailed account [Kuc81]. 

3.2 Theories with higher derivatives 

3.2.1 Introduction 

The Hamiltonian formulation of theories with higher derivatives was firstly devel- 
oped by Ostrogradsky almost one and a half century ago [OstSO]. Basically, the 
Ostrogradsky method [DNF82, Whi37] is adapted only to systems described by a 
regular Lagrangian, that is, a Lagrange function the associated Hessian matrix of 
which with respect to the highest-order time derivatives has a nonzero determi- 
nant. The underlying idea of this method — and of its subsequent generalisations — 
consists in introducing besides the original configuration variables a new set of 
coordinates that encompasses each of the successive time derivatives of the original 
Lagrangian coordinates so that the initial higher-order regular system be reduced 
to a first-order system. In order to recover the standard interpretation of the old 
coordinates when time evolution is considered it is essential to insert into the for- 
malism the definition of the new coordinates in terms of the old ones by means of 
a Lagrange-multiplier technique. As a direct consequence, the auxiliary degrees of 
freedom that enable the order-lowering in the initial Lagrangian are constrained: 
One must resort to Dirac's approach for building up a consistent Hamiltonian for- 
mahsm [GR94]. 
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It seems that the very first use of Ostrogradsky's method was undertaken by 
Kerner in an attempt to develop a Hamiltonian formalism for Wheeler-Feynman 
electrodynamics (cited in [JLM86]). It is a remarkable result indeed in relativistic 
particle dynamics that, for conservatively interacting particles, there does not ex- 
ist an ordinary single-time Lagrangian or Hamiltonian description if the position 
coordinates belong to a Lorentz frame. Rather, it turns out that in a local-in- 
time representation all higher time derivatives must appear in the Lagrangian, 
which is therefore an infinite-order Lagrange function. Now, because infinite-order 
Lagrangian systems exhibit cumbersome mathematical features, there have been 
many attempts to weed out all higher time derivatives occurring in the higher-order 
equations of motion by utilising the equations of motion of lower orders. In such a 
reduction process the question naturally arises whether the original infinite-order 
Lagrangian can also be reduced to an ordinary Lagrangian. The answer is not 
trivial for, in general, a system of second-order ordinary differential equations does 
not admit an ordinary Lagrangian description. On the contrary, the existence of 
a Hamiltonian formulation is warranted owing to the Lie-K6nig theorem [Whi37]. 
Moreover, as Jaen et al. pointed out, a straight substitution into the equations of 
motion implies the intromission of some constraints that modify the very nature of 
the variational principle underlying the Lagrangian formalism. It is precisely for 
this reason that Jaen et al. tackled the problem of finding the reduced Hamiltonian 
by means of the techniques of constrained Hamiltonian dynamics together with the 
Ostrogradsky method in the case of Wheeler-Feynman electrodynamics [JLM86] 
(see also Ellis's canonical formalism for a second-order Lagrangian [E1175]). 

More generally, in the problem of reduction of higher-order Lagrangians de- 
scribing the dynamics of systems of point particles, which are given as formal 
power series in some ordering parameter, Damour and Schafer developed a new 
method, called 'the method of redefinition of position variables', that enabled them 
to eliminate consistently the higher time derivatives, directly at the Lagrangian 
level [DS91]. 

This type of reduction process was also examined by Grosse-Knetter in the con- 
text of general effective higher-order Lagrangians [Gro93, Gro94]. However, it must 
be objected that Grosse-Knetter's treatment is not reliable. The major criticism 
that one could raise has to do with the simplifying assumption that 5^(0)-terms 
occurring in the path-integral formalism may be neglected on the mere analogy 
of what happens in the first-order case. As a matter of fact these terms should 
not be discarded unless a suitable regularisation method would be successfully ap- 
plied. Beyond that particular point, the proofs — quite elusive, by the way — make 
an inappropriate use of the results obtained by Damour and Schafer, which are 
valid only for effective Lagrangians that can be written as formal power series. 
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Furthermore no scrupulous constraint analysis is performed when discussing the 
equivalence of Lagrangian and Hamiltonian path-integral quantisations. In that 
respect the claim that any higher-order effective Lagrangian can be reduced to a 
first-order one without introducing extra degrees of freedom — by the way, "elim- 
inating the unphysical effects associated with Ostrogradsky additional degrees of 
freedom" [Gro94] — is not proved by any rigorous analysis whatsoever. 

On the other hand, Gitman et al. have worked out a consistent method of build- 
ing up a Hamiltonian formalism for any constrained system with higher derivatives 
(see [GT90] and references therein). Buchbinder and Lyakhovich improved the 
treatment in a form that is more appropriate for theories of gravity (see [BOS92] 
and references therein); more specifically, they applied the method to the most gen- 
eral quadratic gravitational action in four dimensions [BL87]; then, with Karataeva, 
they extended the analysis to the reahn of multidimensional quadratic gravity 
[BKL91]. Within the particular subclass of singular second-order Lagrangians 
Nesterenko [Nes89] and Batlle et al. [BGPR88] discussed, in a more geometrical set- 
ting, the relationship between the Lagrangian and Hamiltonian frameworks; Galva 
and Lemos examined the peculiar case of singular second-order Lagrangians that 
differ from a regular first-order Lagrangian by a total time derivative of a function 
of both the coordinates and velocities [GL88]; prompted by the strange superstition 
that dealing with Ostrogradsky Lagrangian constraints ravels the mind, Schmidt 
contrived an 'alternate Hamiltonian formalism' [Sch97] (see also Kasper's similar 
approach [Kas97]). 

In the general case Pons achieved a rigorous unification of the Dirac formalism 
for constrained systems and the Ostrogradsky method for higher-order Lagrangians 
[Pon89]. Saito et al. developed a similar formalism that they applied to the afore- 
mentioned case of a Lagrangian describing the gravitational interaction of two point 
particles [SSOK89]. 

Subsequent formal investigations inspired by the Ostrogradsky method may be 
reviewed briefly. In order to study the interplay of higher-order Lagrangian and 
Hamiltonian formalisms, Gracia et al. introduced 'partial Legendre-Ostrogradsky 
transformations' and constructed some interesting geometric structures [GPR91]; 
within this geometrisation scheme Gracia and Pons extended previous significant 
results [GP92] to the study of Noether-symmetry transformations for higher-order 
Lagrangians [GP95]. Chitaya et al. have analysed the constrained Ostrogradsky 
method for constructing the generators of local symmetry transformations [CGS97]. 
Kaminaga has showed that the adjunction of total derivative terms to a higher- 
order Lagrangian does not affect either the classical or the quantum canonical 
structure of the system [Kam96].^ Nakamura and Hamamoto have analysed the 

■^In the Lagrangian formalism this problem is trivial: The equations of motion are not affected 
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path-integral formalisms associated with different variants of the Ostrogradsky con- 
struction: the standard, the constrained, and the generalised methods respectively 
[NH96]. Most recently, Nirov has elaborated a BRST formalism for systems that are 
invariant under gauge transformations with higher-order time derivatives of gauge 
parameters [Nir96, Nir97]; Pimentel and Teixeira have generalised the Hamilton- 
Jacobi approach for higher-order singular systems [PT98]. 

Very few applications of the Ostrogradsky method to specific higher-order field 
theories have been actually undertaken; for instance, de Urries and Julve have 
demonstrated the physical equivalence of relativistic scalar field theories with higher 
derivatives and their reduced second-order counterpart: They compare the stan- 
dard Ostrogradsky procedure to a Lorentz-invariant method based on the use of the 
so-called 'Helmholtz Lagrangian' [dUJ98]. In the context of higher-order theories of 
gravity, especially in quantum cosmology, the situation is not very different. For a 
quadratic Lagrangian involving R and R^ terms Kasper has compared Buchbinder 
and Lyakhovich's generalised Ostrogradsky formalism to another method that is 
characterised by the introduction of a scalar field at the Lagrangian level; he ob- 
tains approximate solutions to the Wheeler-DeWitt equation, with a closed FLRW 
ansatz [Kas93] (Kasper's analysis has been improved by Pimentel and Obregon 
who solved the same equation analytically [P094]). Pimentel et al. have obtained 
solutions to the Wheeler-DeWitt equation corresponding to the Taub model in the 
case of a pure quadratic R^ Lagrangian [POR97]. In a very recent work, Ezawa et 
al. have developed a canonical formalism for f{R) theories of gravity [EKK"'"98]; 
the formulation presented in Section 3.3 is in total agreement with their approach 
although a slightly different choice for the canonical variables does actually simplify 
the analysis.^ 

Remark. The Ostrogradsky method has also been used to highlight the fundamen- 
tal drawbacks inherent in higher-order theories [EW89, Sim90, Sch94, Sch97]. 

3.2.2 Ostrogradsky's method for regular systems 

Consider a system with a finite number K of discrete degrees of freedom rep- 
resented by generalised coordinates Xk ior k = 1, . . . ,K in configuration space; 
for the sake of clarity we assume that these coordinates are commuting variables 
though it should be clear that exactly the same considerations are valid for com- 
muting and anticommuting degrees of freedom, as well as for infinite discrete or 
noncountable set of coordinates — the former case corresponds to the treatment of 



by the addition of a total derivative term to the Lagrangian. 

'The completion of this work was achieved during a visit at the Department of Mathematics 
of the University of the Aegean (Samos, Greece) in May 1996. 
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bosonic and fermionic types of degrees of freedom and the latter typically that of 
field theories. Consider now that the system under study be described by some 
time-independent — it is for convenience only that we restrict the analysis to time- 
independent Lagrangians; time-dependent Lagrange functions can also be treated 
along the same lines as developed hereafter — regular Lagrange function 

L(xk,Xk,Xk,---,xi''"') for ak^ I andk = 1,...,K, (3.35) 

where we adopt the convention (^) Xk = x^ . The index a^ denotes the maximal 
order of all time derivatives of the coordinate x^ appearing explicitly in the La- 
grangian (3.35). (Note that when a^ = 1 for all degrees of freedom one recovers as 
a special case of the present general treatment the familiar situation encountered in 
classical mechanics for first-order Lagrangians.) In the original formulation of the 
Ostrogradsky method it is also assumed that the Lagrange function does depend 
on at least the first-order time derivative of each degree of freedom in order to 
prevent the occurrence of undesirable constraints. 

As usual we assume that the dynamical time evolution of the system is obtained 
upon extremising the action functional constructed from the Lagrangian (3.35) 
under variations 5xk{t) of the configuration variables Xk- The variational principle 
yields the 2A''*'^-order Euler-Lagrange equations 



^ ^ d y'' dL 



\ ^ / 1 \ o"fc 



<Tfc=0 ^ ^ OXf. 

Adopting Ostrogradsky's prescriptions we introduce new quantities Pk^^y^ ^^^^ 
7fe = 0, . . . , (ofc — 1) (which depend on derivatives of the coordinates x^ up to order 
2aA; — 7fc — 1) that are defined by the following recursion relations 

^fc.°^-i = T7^' (3.37a) 

Pfc,/3fc-i = — 7a-T-T7Pfc,/3fc for/3fc = l,...,(afc- 1). (3.37b) 

dx^: ' '^'^ 

We can now prove the following result (see, e.g., [Whi37, pp. 265]). 

Ostrogradsky's theorem. Let L he a regular Lagrangian depending on gener- 
alised coordinates Xk and their time derivatives up to order ak ^ I for k = 1, . . . ,K. 
Consider new quantities Pk,^k ^^^ 7fc = 0, . . . , (a^ — 1) that are introduced by the 
recursion relations (3.37). The Euler-Lagrange equations (3.36) derived from L are 
equivalent to the system of canonical equations that is obtained from the Hamil- 
tonian 

K afc-1 
fc=l 7fc=0 
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Prooi. Owing to the recursion relations (3.37) the Euler-Lagrange equations (3.36) 
take the simple form 

dL d , , , 

:Pk,o = for fe = 1, . . . , Ofc, (3.38) 



dxk dt 

which is reminiscent of Hamilton's equations of motion. To make this statement 
more specific consider the quantity defined by 

K ak-1 

^ = EE4"=^P'^-7.-i, (3-39) 

k=l 7fc=0 

where the variables Pk,-yk sue determined through the recursion relations (3.37). 
As it stands, the function H should depend explicitly on variables Xk and their 
derivatives up to order (2afe — 1), on account of the definitions (3.37) for the quan- 
tities Pk,-yk the dependence of which on the velocities x^'^ for cjfc = 1, . . . , {2ak — 1) 
is manifest. However, taking the differential of the function H defined above we 
obtain the identity 

d^ = E E (4"'=^'^dp,,,, -p,,,,dx^)) - E(^ -^^.ojdx,, (3.40) 

where the last sum turns out to be a combination of the Euler-Lagrange equations. 
This result establishes that the function H defined by (3.39) depends on variables 
xj^'' for (Tfc = 0, . . . , {2ak — 1) only through a dependence of the variables xj^'' and 
Pfc,7fc themselves, viz. H = H[xj^'' ,pk^'y^.) ■ Note that this property holds even in 
the case of a singular Lagrangian; however, here, unhampered by any degeneracies 
we are able to perform a suitable Legendre transformation: We regard variables 
{xjj'' ,Pk,'y^,) as being canonically conjugate pairs and we only solve for the first- 
order time derivatives of the variables x^'' in terms of variables Pk,ak-i and 
variables xj^'' ; throughout this procedure the latter variables must be considered 
as independent of one another rather than being time derivatives of order jk of 
the coordinates Xk- Finally the identity (3.40) shows that the Euler-Lagrange 
equations (3.36) are equivalent to the system of canonical equations 

^t^ = 4^^ (3-41a) 

dH 

Pfc,7. = -^> (3.41b) 

obtained from the Hamiltonian (3.39). I 
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Let us summarise the whole Ostrogradsky procedure. Firstly, we introduce the 

conjugate momenta pk^a^-i with definition (3.37a), where it should be understood 

that the variables [xk,Xk, ■ ■ ■ ■,x^'' ) are independent. Owing to the fact that 

our Lagrangian is regular, we are able to invert relations (3.37a) for the velocities 
•("fe-i) 



VIZ. 



The remaining conjugate momenta pk,/3,.-i for /3fc = 1, . . . , (a^ — 1) are then de- 
termined by the recursion relations (3.37b); it should be stressed, however, that 
one should not solve these recursion relations for the variables i;^ * in terms 
of the variables (x^, x^, . . . , x^ ''^ '), the conjugate momenta {pk^^ ■ ■ ■ ■,Pk,ak-2) , 
and time derivatives of the latter, since the variables (^Xk,Xk, ■ ■ ■ ^^k'' ) ^^^ ^^~ 
garded as independent in the Ostrogradsky method. 

Secondly, the canonical Hamiltonian is defined by equation (3.39), where the veloc- 
ities x^"* have been substituted in accordance with equation (3.42); explicitly 
it is 

K 

fc=l 



tt( (7fc) \ _ V^ -("fc-l)/ • (Ofc-l) \ 

X'^k ^Pk,"/kj — / _, -^fc I ^^fcj 2;^, . . . , X^ jPk,ak — l J Pk,ak — 1 

ij I Xfc, Xfc, . . . , X^ , X^ y. . . J j 



+ E£4'''=^PfcA-i- (3.43) 

fc=i&=i 

Eventually it yields Hamilton's equations (3.41), which can be written down — upon 
the introduction of the fundamental Poisson brackets on the phase space spanned by 
the pairs of canonically conjugate variables {xjj'' ,Pk,-yk) — as the canonical system 



X 



(Tfc) _ /^(7fc) 
k 



{xl^^H}, (3.44a) 

Pk,fk = {Pk,^k,H}. (3.44b) 

After the Poisson brackets have been computed, it is necessary to impose the 
condition that the variables x^^*" be time derivatives of order 7^ of the original 
coordinates Xk(t). This requirement may be implemented ab initio by firstly asso- 
ciating with each variable x^^*" an auxiliary independent degree of freedom; then 
replacing the original Lagrangian (3.35) by an extended Lagrange function, wherein 
the definitions of the new degrees of freedom in terms of the old ones are inserted as 
constraints with Lagrange multipliers. It is actually the modern way of building up 
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an Ostrogradsky formulation and its advantages are twofold. Firstly, it shifts the 
study of higher-order Lagrangians to the analysis of the usual type of dynamical 
systems for which powerful techniques have been developed so far (see, for instance 
[Gov91]); it thus renders the necessity of a separate discussion of the quantisation of 
higher-order systems void of any justification whatsoever. Moreover the extension 
of the formalism to embody the case of singular higher-order systems may be done 
in a natural way along the lines of Dirac's approach (cf. Subsection 3.2.3). Sec- 
ondly, canonical or path-integral quantisations of such higher-order systems may be 
achieved unhindered by the possible ambiguity that could arise when dealing with 
the variables x^"" and their first-order time derivatives xj^'' ■ Without introducing 
auxiliary degrees of freedom it could indeed be confusing to perform a Legendre 
transformation on the variables x^'' only, while leaving the other derivatives 
untouched. 

We illustrate the Ostrogradsky construction with a simple example. 

Example 3.2.1. We take the Lagrangian L = x^/2 — u;^x^/2 — e^x^/2 corre- 
sponding to a simple (unit-mass) harmonic oscillator with an acceleration-squared 
term. We firstly consider x and y := x as independent variables; the Lagrangian 
then becomes L = y'^/2 — to'^x'^ /2 — e^y^/2- We define Ostrogradsky momenta as 
Py := dL/dy = —e^y and px ■= dL/dy — Py = y + e^y respectively. Only the 
former relation is inverted in terms of ij; we obtain the Hamiltonian of the system, 
H{x,px,y,Py) = ypx — P^/2e^ — y^/2-|-a;^x^/2, from which the canonical equations 
are readily derived: They are x = y, y = —py/e'^, px = —to'^x, and py = —px + y 
respectively. We recognise easily the definitions of y and momenta. Finally we re- 
cover the Euler-Lagrange equations from the canonical equations for the momenta, 
i.e. e^x^"') -I- X -I- lo'^x = 0. 

Let us see now how the original Ostrogradsky construction can be treated within 
the framework of Dirac's Hamiltonian formalism for constrained systems. A sep- 
arate analysis of regular and singular higher-order systems is at this stage unnec- 
essary for, in both cases, the higher-order Lagrangian is reduced to a first-order 
Lagrangian exhibiting primary constraints. 

3.2.3 Constrained Ostrogradsky construction 

Lagrangian lormalism 

Consider a system with K degrees of freedom x^ for A; = 1, . . . , K and assume that 
the associated Lagrangian (with higher derivatives) be given by expression (3.35). 
Let us introduce new independent variables qk,'yk for 7^ = 0, . . . , (a^ — 1) through 
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the following recursion relations 

Qk,/3k = <ikA~i for /3fc = l,...,(afc - 1), (3.45a) 

Qkfl = Xk- (3.45b) 

Clearly this choice corresponds to assuming that the successive time derivatives 
x^'' are specified as independent variables, for relations (3.45) imply 

^Kik=x^k''^ for7fc = 0,...,(afc-l). (3.46) 

Preservation of the standard interpretation of the old coordinates when time evolu- 
tion is considered requires the relations (3.45a) to be brought into the Lagrangian 
formalism as primary constraints. The initial higher-order Lagrange function L is 
replaced by an extended first-order Lagrangian L (with Lagrange multipliers \k,i3k 
for /3fc = 1, . . . , {ak — 1) as additional variables) that is given by 

K Ofe-l 

L{<lkakAk,'yki^k,(3k) '■= L{qk,^kAk,ak~l) +2^1^ i^kA " 9fe,/3fe-l) Afe./Jfe- (3.47) 

fc=l/3fe=l 

The variables (9fc,7fc) -^fc,/?^) thus encompass the independent degrees of freedom 
of the extended Lagrangian system, with {qk,Pu->'^k,Pk) being auxiliary degrees of 
freedom as compared to the original coordinates Xfc(=: qkfi)- 

Before resorting to Dirac's analysis we must ensure that both Lagrange func- 
tions L and L yield equivalent equations of motion [GR94] . 

Proposition 3.2.1. Let L he the original Lagrangian (3.35). Let L he the asso- 
ciated extended Lagrangian (3.47). The Euler-Lagrange equations corresponding 
to L and L respectively are equivalent. 

Prooi. The standard variational principle calls for the action functional constructed 
from the Lagrangian (3.47) to be stationary under variations of the independent 
degrees of freedom. With respect to the auxiliary degrees of freedom, it yields the 
equations 

f) T A 

[<5qi,/3j — > -^ ^ TT^fc.pfe+i + -^fc,pfc = for pfc = l,...,(afc - 2), (3.48a) 

(yQk,pk clt 

dL d / dl 

dqk,ak-i 'it\dqk,ak-i. 

[SK^^] — > qkA - 9fc,/3fc-i = 0, (3.48c) 



[<5%,c,-i] — > -^ -jj { Tj- ) + Afc,afe-i = 0, (3.48b) 



whereas the variation with respect to the original coordinates gives 

Or 1 

[5q.,o] — > -t: + — Afc,i = 0. (3.48d) 

oqho at 
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These last equations (3.48d) are the actual equations of motion of the system: The 
former set of equations (3.48a)-(3.48c) contains in fact constraint equations that 
determine both the auxiliary degrees of freedom qk,/3k ^s the successive time deriva- 
tives of the original coordinates qkfl (equation (3.48c)) and the Lagrange multipliers 
Xk,/3k ill terms of successive partial derivatives of the original Lagrangian L (equa- 
tions (3.48a) and (3.48b)); we may inject the ensuing multipliers into equations 
(3.48d) to obtain the equations of motion 






which in turn reduce to the Euler-Lagrange equations (3.36) as soon as we enforce 
the constraints (3.48c). ■ 

Remark. Hitherto the procedure has been completely general, that is, irrespective 
of the regular or singular nature of the original Lagrangian: henceforth we focus 
the analysis on singular Lagrangians. 

Hamiltonian lormalism 

We now assume that the original higher-order Lagrangian L be singular — though 
the present analysis applies equally well to regular systems — and that the rank of 
its associated Hessian matrix be constant everywhere and equal to K — R (R £N), 
viz. 

rank ^ ^ ^^ = rank — =^ ] = K - R. (3.50) 

V^xl^'^^^xf^'V \dqk,a,~idqi,a,-i J 

With the aim of developing a Hamiltonian formulation on the basis of the La- 
grangian L we proceed to the standard Dirac analysis. 

The configuration space of the extended system is spanned by the set of vari- 
ables (gfc,7fe , Aa;,/3 J for 7fc = 0, . . . , (afc - 1) and /3fc = 1, • • • , (ofc - 1). We define the 
momenta canonically conjugate to the independent degrees of freedom as 

dL , , , 

PkA~i ■= J^ = -AfcA> (3.51a) 

dL dL , , 

PMfc-i := J^ = ^ ' (3-51b) 

Oqk,ak-l Oqk,ak-l 

dL 

VTfcA ■■= TT^ = 0, (3.51c) 
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where we have used equation (3.47). We thus see from equations (3.51) that the 
phase-space degrees of freedom [qk,'ykiPk,"fk'^^k,Pui^k,p,,) are not all independent; 
hence we may identify the following set of primary constraints, 

</'fc,/3fc = Pfc,/3fc-i + \,(5k ~ 0, (3.52a) 

vrfc,/3, « 0, (3.52b) 

to which we add the constraints stemming from the singular character of the original 
Lagrangian (cf. equation (3.51b)), 

</'j(9fe,7fc>Pfc,afc-i) ~0 for i = l,...,i?. (3.52c) 

The first set of primary constraints (3.52a)-(3.52b) originates in the specific way 
the auxiliary degrees of freedom have been introduced into the extended Lagrange 
function L (cf. equation (3.47)); given the fundamental Poisson brackets on the 
phase space, 

{9fc,7fc,g«,7j = = {pkak^Pirn]^ {qkrik^Pi^-^i] = ^ki^^kii^ 

{Afc,A,Ai,/3j = = {TTk,f3k^T^UPi]^ {Afc,A,Vr,^^J = 6kl5f3^f3^, 

these primary constraints satisfy 

{VkA^^iA} = ^ki^^kfBi (3.53) 

and hence are second class (they will be removed from the formalism later on). 

Before pursuing Dirac's analysis let us show how it is possible to recover Os- 
trogradsky's prescriptions for the definition of the momenta Pk,yk [Pon89]. 

Proposition 3.2.2. Let L be the extended Lagrangian associated with the orig- 
inal singular Lagrangian L. The Euler-Lagrange equations obtained from L with 
respect to the auxiliary coordinates qk,i3k (^or f3k = 1, . . . , a^ — Ij are equivalent 
to the recursion relations (3.37) defining the Ostrogradsky momenta Pk,^^ (for 
7fc = 0, ... ,afc - Ij. 

Prooi. It is sufficient to examine the Euler-Lagrange equations obtained from the 
Lagrangian L with respect to the auxiliary coordinates qk,i3kj namely 

7r^-i;(7F^)=^ for/3fe = l,...,(afc-l). (3.54) 

Taking definitions (3.47) and constraints (3.52a) into account equations (3.54) are 
equivalent to 

PkA~i = ^ -JiPkA for /3fc = 1, . . . , Ofc - 1, (3.55) 
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which, owing to the definitions (3.45a) of the auxiliary degrees of freedom, coin- 
cide with the recursion relations (3.37b). On the other hand expressions (3.51b) 
correspond to definition (3.37a). Hence, as announced, we recover the definitions 
(3.37) of the Ostrogradsky momenta. ■ 

Remark. The remaining Euler-Lagrange equations 

dL A f dL 



^ A,\r^- I ^ (3.56) 

oqk,o dt\dqk,oJ 

are equivalent to the standard Euler-Lagrange equations for the Lagrangian L (cf. 
equation (3.48d) and the subsequent discussion). 

Of particular concern is the prevailing role played by the momenta pk^a^-i'- 
They are indeed the only variables not involved in the primary constraints (3.52a), 
(3.52b), and their conjugate coordinates qk^a^-i ^^e the only auxiliary variables 
the velocities of which do appear in the original Lagrangian L (compare equations 
(3.51a) with equations (3.51b)). These remarks about the special role played by the 
pairs of conjugate variables {qk,ak-iTPk,ak-i) motivate the definition of a restricted 
canonical Hamiltonian that gives the energy corresponding to the Lagrangian L 
when the variables q^^p^-i for /3fc = 1, . . . , (a^ — 1) have been frozen; it is 

K 

Hr{qk,^k'Pk,ak-l) '■= yZ'ik,ak~lPk,ak-l " -^(gfc,7fc , gfc,afc-l) (3.57) 

fc=l 

and dwells in the restricted phase space — irrespective of whether the relations 
(3.51b) are invertible or not [HRT76, Sun82]. 

We define the canonical Hamiltonian of the system in accordance with the usual 
prescription: 

X ctfc-l K ak~l 

^c = ^ ^ qkrikPKik +X] X] -^fcA^fcA -^(9fc,7fe'^fc,7fe>-^fc,/3j- (3.58) 

fc=l7fc=0 fc=lA=l 

Making use of the constraint equations (3.52) and definition (3.57) of the restricted 
Hamiltonian we may write the canonical Hamiltonian (3.58) as 

K Qfc-l 

He {qkau^Pknu) = Hr{qkau^Pk,Ck-i) + XI IZ Pk,Pu-i 1k,Pu- (3-59) 

k=l /3fc=l 

We now proceed to the Dirac analysis of the system the canonical Hamiltonian 
of which is given by (3.58). Firstly, we write down the Dirac Hamiltonian of the 
system 

K afc-l R 

H^:=H, + Y.Y. ('"''^' ^^'A + ^^'^^ ^Kh) + Y, uJ'(t>^, (3-60) 

fe = l /3fe = l 4=1 
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with new Lagrange multipliers ^^'t^k^yk^Pk ^ ^nd w* associated respectively to the 
primary constraints (3.52) of the extended system. However, as indicated above, 
the first two constraints (3.52a) and (3.52b) are second class: They can be re- 
moved provided the canonical Poisson bracket is replaced by the appropriate Dirac 
bracket. Further we can easily check that the consistency algorithm does not gen- 
erate secondary constraints from these primary ones: Their Dirac bracket with the 
Dirac Hamiltonian (3.60) must vanish on the constraint surface; this requirement 
yields a unique determination of the multipliers ^'^'^'= and v^^l^i' . We thus solve 
these second-class constraints, i.e. \k,p^, = —Pkfi^-i and ^Tk,p^, = 0; hence the Dirac 
Hamiltonian (3.60) simplifies to 

R 
H = Hc + ^uj^(t)i. (3.61) 

i=l 

The local symplectic structure on the phase space is specified through the funda- 
mental canonical Dirac brackets 

{9fc,7fc,?i,7Jx> = = {Pkrik^Plai]'!)^ {9fc,7fc>P«,7i}x. = ^kl^-ikll^ (3-62) 

and time evolution results from the knowledge of this symplectic structure and the 
explicit form of the Hamiltonian (3.61). 

Henceforth we have all the prerequisites at our disposal to generalise Ostrograd- 
sky's theorem for singular Lagrangians [Pon89]; we proceed gradually, establishing 
partial results which will be collected eventually. 

Proposition 3.2.3. Let H he the Dirac Hamiltonian (3.61) associated with the 
singular system. The Hamilton-Dirac equations with respect to the variables 
Qk,i3k-i (for Pk = 1) • • • ) CKfc~lj s^e equivalent to the Lagrangian constraints (3.45a). 

Prooi. We readily obtain the equations 

QkA-i = {QkA^i,H}^ = — = gfcA' (3-63) 

which, obviously, are equivalent to the Lagrangian constraints (3.45a). I 

Proposition 3.2.4. Let H be the Dirac Hamiltonian (3.61) associated with the 
singular system. The Hamilton-Dirac equations with respect to the variables 
Qk,a^-i are equivalent to the dehnition of the momenta pk^a,,-!- 
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rrooi. We expand the equations of motion for the variables g^.a^-i, viz. 

R 



R 



OH, , ,-^ 



9Pk,ak-l ■^ dpk,ak 



(3.64) 



These correspond to the first half of Hamilton's equations for the Lagrangian L, 
where the variables qk,/3,.-i have been frozen. Since the functions H,. and 4>i depend 
solely on the momenta pk^at-i^ the Dirac brackets in (3.64) may be viewed as the 
canonical brackets defined in the restricted phase space iqk,ak-i,Pk,ak~i)- We 
observe that it is always possible, in principle, to express the multipliers u^ as 
functions of the coordinates and velocities {qk^^j,-,qk,a,,-i) if we solve the equations 
[Pon89, HT92] 



Qk,ak-l 



dH, 



dpk,ak-l 
R 



Qk,^k^Pk,ak-l\Qk,fk^ 1k,ak-i 



+ '^^'{Qk,-,k,Qk,ak-l] 



i=l 



dPk,cxk- 



,1k 



,Pfc,afe-l(gfc,7fc,gfc,afe-l)), (3.65) 



and provided that all the constraints, (pi ^ foi i = 1, ..., R, are independent.^ 
Moreover, the existence of these extra variables w* enables us to invert the Legendre 
transformation defined from (g, g)-space to the constraint surface in {q,p,uj)-space 
by means of the one-to-one correspondence 



'7/c,7fe — 9fc,7fe ) 



qk. 



Ik 



qk. 



7fc' 



dH, 



L0i(9fe,7fe)Pfc,Qfe-l) 



= 0. 



Consequently, equations (3.64) and the definition (3.51b) of the momenta pk^a^-i 
are equivalent. I 



Before considering the Hamilton-Dirac equations with respect to the conjugate 
momenta we can prove the following useful lemma [Pon89]. 



^This corresponds to the irreducible case that we assume for simplicity. The reducible case 
could be treated without much difficulty (see [HT92]). 



3.2. Theories with higher derivatives 75 



Lemma 3.2.1. Under the current assumptions the following identity holds: 

dHr sr- if ■ \ 9(pi dL 

j^ = -|:.fe,..,,„._Oa^-j^. (3.66) 

r rooi. From the above remarks on the invertible character of the Legendre trans- 
formation we may infer the identity 



Hr [ qk,-/k , Pk,ak-1 [qk^k ' ik,ak-l, 
K 
= 2j^fc'°fe-l(^'=.7fe'9fc,Qfe-l) 9fc,afe-l - L{qk,j^,qk,ak-l), 



k=l 










whence we obtain 










dHr 

dqkrtk 


1=1 ^ 




dpi,ai-l 

dqkrtk 


dL 

dqkau 



After the apphcation of equations (3.65), this becomes 

dH, ^-^^-^ i, . . d(f>i dpi^ai-i dL 

Since the identity 

</'i(%,7fe,Pfe,Qfe-i(gfc,7fc,4fc,Qfe-i)j =0 (3.68) 

obviously holds, equation (3.67) reduces to the expected result, namely 

9Hr \^ if ■ \ 9(pi dL 

7^ = -2^^ [1knk^'lk,ak-i)Tr jr- • (3.69) 

dQk,-/k f^l dqk,^^ dqk,^^ 



Consider now the second half of Hamilton's equations. 

Proposition 3.2.5. Let H he the Dirac Hamiltonian (3.61) associated with the 
singular system. The Hamilton-Dirac equations with respect to the conjugate 
momenta Pk,Pu f^^r f3k = 1, . . . , afc — 1 j are equivalent to the Ostrogradsky recursion 
relations (3.55). 

Prooi. We firstly write down the equations of motion with respect to the conjugate 
momenta Pk,i3k '■ 

R 
PkA = {PkA,H}^ = {pkA,H,}^ + "^uj'{pk,fs^,^i}^-pk,f3^^i. (3.70) 
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Owing to Lemma 3.2.1, this reduces to 

dL 



PkA 



dqkA 
which is equivalent to equation (3.55). 



Pk,i3k-i for /3fc = 1, . . . , ttfc - 1, 



(3.71) 



We are now at the right stage to prove the Ostrogradsky theorem for constrained 
systems [Pon89]. 

Generalised Ostrogradsky theorem. Let L be a singular Lagrangian depend- 
ing on generalised coordinates Xk and their time derivatives up to order a^ ^ 1 (for 
k = 1, . . . , K). Let L be the associated extended Lagrangian depending on the aux- 
iUary degrees of freedom g^^^^ = xjj'' (for 7^ = 0, . . . , a^ — Ij. The Euler-Lagrange 
equations derived from L are equivalent to the system of canonical equations ob- 
tained from the Dirac Hamiltonian 

R 

i=l 

rrooi. Owing to Lemma 3.2.1, Hamilton's equations for the momenta Pkfi, 

R 

]^-PkA-i, (3.72) 



Pk,0 = {Pk,0,H}^ = {Pk,0, Hr}^ + J2'^'{Pk,0,4>i}' 

reduce to 

Pk,0 



i=l 



dL 



dqk,o ' 



(3.73) 



which are identical to the Euler-Lagrange equations (3.38). The conclusion is then 
readily inferred owing to Proposition 3.2.1, Proposition 3.2.2, Proposition 3.2.3, 
Proposition 3.2.4, and Proposition 3.2.5. ■ 

The above results establish the equivalence between the original Lagrangian 
equations and Hamilton's equations derived from the Hamiltonian (3.61). The 
details of this equivalence are displayed inside the box below. 



qk,/3k-l = {Qk,l3k-l:H}^ 
Qk,ak-1 = {(lk,a^,^l,H}^ 
PkA = {PkA,H}ri^ 

Pk,o = {pkA,H}^ 



qk,l3k - ?fc,/3fe-l, 
[Lagrangian constraints] 

— dL 

Pk,ak-1 - dqk,c,,,-i' 

[Definition of momenta Pk,c«N-i ] 

dL _d_ 

Pk,l3k~l — dqk,i3^^ dtPf^Ai 

[Recursion relations for momenta Pk,/3|^- 
_d „ _ dL 
dtPkfi - dqkfi ■ 

[Euler-Lagrange equations] 
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The canonical equations thus include: Lagrangian constraints corresponding to 
the auxiliary degrees of freedom; Ostrogradsky's definition of momenta; and the 
Euler-Lagrange equations of motion. 

Remark. If we performed the Legendre transformation on the basis of the definition 
of momenta (3.37) — that is, without introducing an extended Lagrangian — , then 
new constraints would arise: Their explicit form would be obtained by requiring 
that the primary constraints (3.52c) be preserved in time [SSOK89]. 

If we write down the Hamilton-Dirac equations in the following nonnormal 
form, 



dqk. 



R 



Ik 



dt 



dt 



{^fe,7fe'^c}i, + X]^*(^fc.7fe'9fc,"fe-l){^fc,7fe''^i}D' (3.74a) 

R 
{Pk,y,, He} ^ + ^^' {qk,y^, qk,a,-l) {Pk^k^ M V' (3.74b) 



j=i 



then it is clear that we do not impose any constraint on them — constraints (3.52c) 
are hidden in equations (3.71). The ability to cast the above equations (3.74) into 
the standard form 

—^-{qk,^„Hc}^ + Y,uj'{t){qk,^^,ct)i}^, 

i=l 



dPfc,7fc 



R 



{Pfc,7fc , He} ^ + ^Uj\t){pk,j^, Cl)i} , 



where uj^(t) are arbitrary functions of time, entails the restriction of the trajec- 
tories in phase space to the surface that is generated by the constraints (3.52c). 
This requirement leads generally to the determination of some multipliers and to 
secondary constraints as well. 

The Hamiltonian formulation of higher-order theories obtained by unifying the 
Ostrogradsky method with the Dirac formalism for constrained systems is not sat- 
isfactory when having in prospect the development of a canonical formalism for 
gauge theories; the very reason lies in the fact that the usual formulation of gauge 
theories is given in terms of geometrical quantities like covariant derivatives or cur- 
vature tensors: In particular, the prescription (3.45) for introducing the auxiliary 
degrees of freedom does not take into account any of those features present in a 
gauge theory. Therefore, we present hereafter in Subsection 3.2.4 a generalisation 
of the constrained Ostrogradsky construction that will allow us to bring forth a 
consistent Hamiltonian formulation of higher-order theories of gravity. 
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Remark on an alternative formalism. A coherent way of building up a Hamilto- 
nian formalism for theories with higher derivatives is provided by the constrained 
Ostrogradsky method, as explained above. Curiously enough, some authors de- 
vised an alternative formalism, in the specific case of second-order gravitational 
Lagrangians, the peculiarity of which is to preclude the occurrence of constraints 
at the Lagrangian level [Sch97, Kas97]. The trick is the following: add to the orig- 
inal second-order Lagrangian Lq{x,x,x) a total time derivative of a second-order 
arbitrary function W{x,x,x); replace straight into the action functional — that is 
without Lagrange multipliers — the variables x by new independent degrees of free- 
dom q; then fix the explicit form of the function W by requiring that variation 
of the action with respect to q yield precisely the relation x = q. This skirting 
procedure, which takes advantage of the freedom of adding a total derivative to 
the Lagrangian without altering the equations of motion, is misleading: An am- 
biguity arises due to the presence of terms involving dx/dt. For this reason, the 
would-be advantage of the recipe — the mere absence of constraints in the varia- 
tional principle — is on the contrary a serious drawback in comparison with the 
constrained Ostrogradsky approach. Even worst, this contrived formalism comes 
to naught in the case of singular second-order Lagrangians: The treatment of the 
primary constraints is not compatible with the choice x = q (see the comment on 
page 82). 



3.2.4 Generalised constrained Ostrogradsky construction 

The basic idea of the generaUsed Ostrogradsky method for constrained higher-order 
systems is to allow for a more general definition of the auxiliary Ostrogradsky 
variables q^^, instead of the standard definition (3.46). At first sight the general 
formalism that is given in the literature provides a satisfactory treatment [BL87, 
BOS92, GT90]. However, as explained below, it is tainted with a small technical 
mistake that renders the method ineffective in general; this illustrates how pitfalls 
may arise when one is building up such an abstract theoretical setting without 
testing it on simple examples or toy-models. Nevertheless this general formalism 
can be suitably adapted, taking into account the specific features of the system 
under investigation; we shall come back to this point later (cf. Subsection 3.3.2 
on f{R) theories of gravity); but, in the meantime, we analyse the generalised 
Ostrogradsky method. 

Consider a system with K degrees of freedom Xk ioi k = 1, . . . ,K and assume 
that the associated Lagrangian (with higher derivatives) be given by expression 
(3.35). Let us introduce new independent variables qk,'yk for 7^ = 0, . . . , (a^ — 1) 
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in accordance with the prescriptions 



9feA = Qk,i3k[xi,xi,...,xl 
<?fc,0 = Qk,0 = Xk, 



(dkl) 



9ki = min(/3fe,a/ - l), 



(3.75a) 
(3.75b) 



where Qk,Pk ^'^^ arbitrary functions that may depend on generahsed coordinates 
xi and their time derivatives up to order Oki- We require that relations (3.75a) be 
invertible in terms of the highest-order time derivatives occurring in the exphcit 
form of the functions Qk.Pu ! ^^i^ requisite entails the conditions 



A, 



detf% 



/O fora = l,...,Af-l, 



(3.76) 



where the matrices associated with these nonzero determinants A^- contain only 
those elements for which the determinant index a satisfies a < N {= sup^-jafc}) 
(see Example 3.2.2). 

In contrast with the standard definition (3.45) of the auxiliary degrees of free- 
dom, definition (3.75) is more general and it enables one to choose specific forms 
for the functions Qk,Pk according to the actual characteristics of the system under 
study. Formally, different choices of functions Qk.pu ^^^^ ^^^^ ^^ distinct Hamilto- 
nian formulations; in that respect, we will address in the sequel the question of how 
these various formulations are connected with one another (see Proposition 3.2.8). 

We consider now a specific example in order to clarify the practical use of 
prescriptions (3.75) and (3.76). 

Example 3.2.2. Let L = L(x,i;,x,x^^'*,y,y,i/, y'-^^y^^^). We may infer, from the 
highest-order time derivatives corresponding to x and y occurring in L, the values 
of ctfc, namely a^; = 3 and ay = A respectively. We introduce new coordinates 
according to the rules (3.75), that is 



Qxfi = X 

qx,i = Qx,i{x,x,y,y) 

qx,2 = Qx,2{x,x,y,y,y) 



qyfi = y 

Qy,i = Qy,i{x,x,y,y) 

qy,2 = Qy,2{x,x,y,y,y) 

qy,3 = Qy,3{x,x,y,y,y,y'-^^). 



We then construct the determinants Aq-, fulfilling conditions (3.76): 



Ai 



dQa:,i/dx dQx,i/dy 
dQy^i/dx dQy^i/dy 



Ao 



dQx,2/dx dQx^2/dy 
dQy,2/dx dQy^ijdy 



A, 



dQ 



yfi 



9t/(3) 
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Owing to conditions (3.76) we may solve equations (3.75a) in terms of time 
derivatives of order (3^ (for /3fc = 1, . . . , a^ — 1) of the generalised coordinates; the 
resulting functions are 






Henceforth we slightly depart from what is found in the literature (see, e.g., [BL87]), 
wherein an erroneous result renders the subsequent analysis ineffective in general. 
Nevertheless let us firstly demonstrate the mistake. The authors claim that the 
first-order time derivatives qk,i3k-i could be expressed in terms of the auxiliary 
degrees of freedom g;^p; (for pi = 0,...,6ki) only. However, in general this is 
not true, for differentiating with respect to time the quantities x^'' for a^ = 
1, . . . , (ttfc — 2), which are given in equation (3.77), we obtain the series expansion 

^ ""''' dX"'' 
K'^' = J2T.7r^'i^'Pi' 7H = min(afc,a,-l), (3.78) 

1=1 pi=0 ^^''« 

which contains explicitly first-derivative terms of the form ^/,a,-i if it ever happens 
that cjfc ^ {ai — 1). Unfortunately this last inequality corresponds to the generic 
case, as it can more easily be checked on Example 3.2.2. The only means to 
remove these first-derivative terms is to assume that each coordinate Xk occurring 
in the Lagrange function L have one and the same maximal order N (i.e. a^ = 
N V/c). This requirement is not a severe restriction though for we could always 
add a total derivative term to the original Lagrangian (3.35) without modifying 
the Lagrangian dynamics and such that the above assumption would be satisfied. 
However, we must ensure that distinct Hamiltonian formulations stemming from 
Lagrangians that differ by a total derivative only be equivalent; this is achieved 
with the following result.^ 

Proposition 3.2.6. Consider two Lagrangians, L (of order N) and Lq (of order 
N — 1) that differ by a total time derivative, viz. 

L(x,x,...,x(^))=Lo(x,x,...,x(^-i)) + ^W^(x,x,...,x(^-i)). (3.79) 

The two distinct Hamiltonian formulations that are constructed on L and Lq re- 
spectively are canonically equivalent; the appropriate canonical transformation is 
defined by 

9/3-1 = gf 1, (3.80a) 



'^For our purpose it is sufficient to consider only one degree of freedom. 



3.2. Theories with higher derivatives 



JO) 



dW 



V(5-i=p'pU + -j^, (3.80b) 

where (g^_i,p^_i) and {qa_i-,Pa_i) (for P = I, . . . , N — 1) denote the canonical 
variables associated with the Lagrangians L and Lq respectively. 

Prooi. We firstly examine the Ostrogradsky formalism for the Lagrangian Lq. 
We introduce auxiliary degrees of freedom and their conjugate momenta through 
the standard recursion relations 

(0) (0) dLo 

' yN-2 ~ „ .(0) ' 



d<i'N'^ 



2 



o(0) - 0^°^ 0^°) - ^^ - — o(°) for CT - 1 iV - 2 

'da — la-l' Pa-1 — (Q) ^^ "o" lOr Cr — i, . . . , JV Z. 



We then obtain the canonical Hamiltonian corresponding to Lq, namely 

N-2 

(n\ (0) . .(0) (Q) ^ f rm ,., 

I ^N-2 , 



rr(0)/„(0) JO) \ V- (0) JO) .(0) (0) tJM M 

J^c [qp-vPis-ij - /^ qg'Pa-i + Qn~2Pn-2 - ^OyQfS-yf^ 



a=l 

Now we turn to the Ostrogradsky formalism for the Lagrangian L. 

Once again we introduce auxiliary degrees of freedom and their conjugate momenta 

through the standard recursion relations 

dW 

90 = X, PN-l = -^ , 

dL d 

qp = qp-i, pp-i = — p/3 for /? = 1, . . . , Af - 1. 

We then obtain the canonical Hamiltonian corresponding to L, namely 

N-l . „^ X 

Hc{q-y,P-y) = "^{p^-i - -Q J9/3--^o(i?7) for 7 = 0, . . . , A^ - 1. 

Note that the restricted Hamiltonian (3.57) does not depend here on the momen- 
tum pn-1] hence the pair of canonical variables [qN-i^PN-i) are associated with 
spurious physical degrees of freedom. It is then straightforward to see that the 
canonical transformation (3.80) turns the Hamiltonian H^ into the Hamiltonian 
Hi'\ U 

For completeness we write down the explicit form of the nontrivial primary 
constraints of the A''*'^-order theory, 

dW 

(pN-i = PN-l - ^ ~ 0, (3.81a) 
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(j)N-2 = PN-2 - ^ 7^ ~ 0. (3.81b) 

dqN-i oqN^2 

The former constraint is first class and does not generate any secondary constraint; 
it corresponds to the gauge freedom associated with the choice of the function W, 
especially with respect to its dependence on the variable qn-i- The last constraint 
arises from the singular character of the original Lagrangian Lq; its preservation 
in time generates secondary constraints which are classified as usual into first- 
class and second-class constraints. Furthermore it can be easily shown that the 
Dirac bracket structure is preserved under the canonical transformation (3.80) and 
that both Hamiltonian formulations lead to equivalent quantum counterparts via 
canonical [Kam96] or path-integral quantisation methods [Gro93]. 

Further remark on an alternative formalism. Proposition 3.2.6 sheds a new light 
upon the alternative formalism briefly discussed on page 78. Kasper shows that this 
formalism is canonically equivalent to the Ostrogradsky construction performed on 
the original second-order Lagrangian Lq [Kas97]. This relationship is even more 
manifest when one applies the Ostrogradsky method to the third-order Lagrangian 
involving the total time derivative of a specific second-order function, the explicit 
form of which is chosen in accordance with the aforementioned recipe. Indeed, the 
relation (3.80) enables us to write down the corresponding Ostrogradsky momenta, 
Po and pi, in terms of the original momenta, Pq and p\ , and compare them with 
the 'alternative' momenta, ttq and vri. We find the relations 

(0) ^ dW 
dqo 
p^_p(o) I dW _dLo ^ dW ^^^ 
^ g(j(°) dx dx 

as well as the definition of momentum p2 , 

dW 

P2 = -TTTT = vri. 

ox 

The vanishing of the momentum pi is a direct consequence of the hypothesis that 
the original Lagrangian be regular and of the condition that has to be fulfilled by 
the function W to render the alternative formalism meaningful. In other words, the 
alternative formalism has been defined to preclude the occurrence of the primary 
constraints (3.81). If the standard constraint analysis is performed within the 
Ostrogradsky formalism, then the above relations for pi and p2 are second-class 
constraints. 
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In agreement with the previous discussion we thus assume that the maximal 
order of variables Xk is one and the same, that is a^ = A^ Vfc. The auxiliary variables 
are introduced with the definition (3.75) — with 9ki = Pk — such that conditions 
(3.76) be fulfilled. Thus, differentiating equations (3.77) with respect to time we 
can express the time derivative of variables qk^p^-i as 

Qk,t3k-i = QkA (9«,o, • • • , Qi,t3k)^ (3-82) 

where the functions Qk,i3k can be determined with the recursion relations 

Xl = Qk,i, (3.83a) 



dX° 
^r^' = E E 7^Q'.«+i forcT. = l,...,iV-2. (3.83b) 



d(ii,pi 



We introduce velocities Vk^N-i with the definition 

Vk,N-i ■■= qk,N-i- (3.84) 

Remark. At first sight, it could seem unnecessary to add more degrees of freedom 
to the theory: Indeed, we did not need to introduce such velocities through the 
constrained Ostrogradsky construction (cf. Subsection 3.2.3). In the literature 
the consideration of these new variables is merely a loophole: When one adopts 
definition (3.84) the equations of motion for the variables qk,N-i simply become 
Lagrangian constraints; the equivalence with the definition of momenta pk^N-i is 
lost and primary constraints (pi ^ do not occur at this stage. So to speak, one 
eludes the discussion given in the proof of Proposition 3.2.4, that is the discussion 
on the invertibility of the Legendre transformation associated with the presence of 
primary constraints 4>i ~ 0. The resulting formalisms are hybrid-like: At one time 
velocities are regarded as independent degrees of freedom; at another time they are 
viewed as the first-time derivatives of variables qk^N-i- This is confusing and this 
could lead to mistaken interpretations (see [BL87, BOS92, GT90]). For that very 
reason we give a more throughout analysis, hereafter. 

Assuming that we adopt the definition (3.84) for the velocities Vk,N-i we can 
prove the following result [BL87]. 

Proposition 3.2.7. Under the current assumptions the functional form of the 
highest-order time derivatives of the generalised coordinates Xk is given by 



^k 



( dX \ 
X^{qi,o,...,qi,N-i,vi,N-i), det ^ / 0. (3.85) 
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Prooi. Taking into account equations (3.75), (3.77), and (3.82) we obtain succes- 
sively 






1=1 Pi=0 



1=1 pi=0 ^'■'P^ 1=1 ^''" ^ 

where the last right-hand side may be written as a function X^ , which could be 
formally given by equation (3.85). ■ 

We now interpret equations (3.82) and (3.84) as Lagrangian constraints and we 
replace the original (higher-order) Lagrange function L by the extended (first- 
order) Lagrangian L, 

K N-1 
L{q, q, A, V, fj) = L{q, v)+^Y^ (%A-l - QkA)XkA 

fc=iA=i 
K 

+ ^{Qk,N-l - Vk,N-l)fJ'k,N-l, (3.86) 

fc=l 

with Lagrange multipliers Xk,i3k ^^^ l^k,N-i, so as to recover the interpretation in 
terms of the original set of coordinates. In close analogy with Proposition 3.2.1 the 
Euler-Lagrange equations derived from the extended Lagrangian L are equivalent 
to the Euler-Lagrange equations obtained from the original Lagrangian L. (The 
proof sheds no new light upon the subsequent analysis.) 

The momenta canonically conjugate to the independent degrees of freedom are 
defined respectively by 

dL 

PkA^i ■- ^T XkA, (3.87a) 

dL 
Pk,N-i ■= ^-^^ — = Pk,N-i, (3.87b) 

CQk,N-l 



(p) ._ dL 



■ 0, (3.87d) 

0. (3.87e) 



dfik,N-i 
(v) ._ dL 



dvk. 



N-l 



We thus have five sets of Lagrangian constraints that we denote as Lpk,p^-i ~ 0, 
'^k,N-i ~ 0, TT^ j^ K. 0, vr[.^jv-i ^ 0' ^"d '^k,N-i ^ respectively. 
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The canonical Hamiltonian of the system is readily obtained, viz. 

K N-1 K 

Hc{q,p,v) ='^'^ pk,f3f^-iQk,i3k +'^Pk,N-iVk,N-i - L{q,v), (3.88) 

fc=l/3fc=l fc=l 

where the constraints in the extended Lagrangian (3.86) have been used. 

Before resorting to the Dirac analysis of the system with canonical Hamiltonian 
(3.88) we must firstly identify the primary constraints that characterise the singu- 
lar theory under study. In contrast with the standard treatment — the formalism 
without the extra velocities Vk^N-i — , these primary constraints do not occur in 
definition (3.87b) of the momenta Pk,N-i- Instead, they occur in the canonical 
equations of motion for the momenta vr'^-* , derived from the canonical Hamiltonian 
(3.88), viz. 



^(^) _ dHc _ dL 

dVk,N-l dVkN-l 



Kn-i = -7^-^ = -. P^^N-, « 0, (3.89) 



where we have enforced the preservation in time of the constraint (3.87e). In 
the aforementioned hybrid formalism no rigorous constraint analysis is performed 
and equation (3.89) is used to define the primary constraints (t)i{q,p) ~ 0. How- 
ever, if the velocities are considered as independent variables — not merely a handy 
notation — , then the above interpretation of equation (3.89) is misleading. In other 
words, to ensure consistency of the formalism one must either refrain oneself from 
introducing velocities v as additional — though spurious — degrees of freedom or one 
has to perform the standard constraint analysis, assuming that these velocities are 
independent — though constrained — degrees of freedom. Choosing the first option 
we are brought back to the analysis made in Subsection 3.2.3; adopting the second 
we must consider the Dirac Hamiltonian to be given by 

K N-l 

H^--=H. + EE {^''''-' ^^.^.-1 + ?af 41, 
fc=iA=i 

K 
I Sr^f k,N~l , (-k,N-l (p) , ^k,N-l (v) \ /o nn\ 

+ 2^[V' ^k,N-i + e(^) <^_i + C(;) n'N-i) , (3.90) 

k=l 

with Lagrange multipliers i] and ^ associated with the primary constraints. 
Enforcing preservation in time of the primary constraints we obtain 

^.,..-1-0 - <f)' = J^ EEY^^)^^./^^' (3.91a) 

W dqk,N-i ^^^-^\dqk,N-iJ 
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and 

41 - -^ ^'='^'=-1 = 0, (3.91c) 

4^-1 « — 7?^'^-! = 0, (3.91d) 

4!Li « -^ xl:Li := 7^- Pk,N-i « 0. (3.91e) 

Some multipliers are thus determined through equations (3.91a)-(3.91d) while the 
secondary constraints XkN-i ^^ arise from equation (3.91e). Note that all con- 
straints but those associated with the velocities are second class; we can thus 
coherently remove them from the formalism provided that we define the appropri- 
ate Dirac bracket. On the other hand, time-evolution of the secondary constraints 
XkN-i yields the equation 



1=1 






which enables to fix K — r multipliers ^(„) only, on account of the fact that the 
rank of the Hessian matrix, as given by equation (3.50), is expressed in terms of 
the velocities Vk,N-i by 

rank ^^ = K - r. 3.93 

yOVk,N-lOV^N-l J 

At this stage one may work out the standard consistency algorithm of the 
Dirac method, ending up eventually with a complete set of primary and secondary 
constraints, which can be classified into first-class and second-class constraints 
depending on the specific features of the theory. 

Remark. In contradistinction to the hybrid approach [BL87] equation (3.91e) de- 
fines constraints, be the system regular or not. 

This is now the right stage to address the aforementioned issue on the possi- 
ble connection between distinct Hamiltonian formulations stemming from different 
choices of the arbitrary functions Qk,i3k ', the following proposition provides us with 
the appropriate answer [GT90]. 

Proposition 3.2.8. The Hamiltonian fornaulations developed frona one and the 
same Lagrangian L by different means of introducing the auxiliary degrees of free- 
dom in the generalised Ostrogradsky method are canonically equivalent. 
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Prooi. It is sufficient to prove that an arbitrary choice of the functions Qfc,^ leads 

to a Hamiltonian formulation that is canonically equivalent to the constrained 

Ostrogradsky Hamiltonian formulation of Subsection 3.2.3. 

We consider the standard Ostrogradsky variables iqk,'rkiPk,'rk) defined as previously 

by 

Wk) 

Qkfl — Xk, qk,l3k — ^k ' 

dL dL 

Pk,N-i - jr. > Pfc,/3fc-i = -w- PKPk- 

oqk,N-i (yQk,i3k 

We introduce the generalised Ostrogradsky conjugate pairs {Qkrtk^ -^k,'yk) with the 
following prescriptions: Firstly, the auxiliary degrees of freedom are defined by 



Xk — Qk,0-, 

^k '' =Xk''iQi,o:---,QiA); 

then, their conjugate momenta are determined through the recursion relations 

dL 



Pi 



k,N-l 



dQk,N- 



K N-l r,^ p.j 

which generalise relations (3.55). (Note that we do not introduce the velocities 
Vk^N-i to keep the comparison between both formalisms more transparent.) 
We try to find the generating function of the canonical transformation that maps 
variables {Q,P) onto variables {q,p) [SM74]. On account of the above definitions 
of variables qk,'yk the appropriate choice consists in taking the type-2 generating 
function F2[Q,p) that is defined by 

K K N-l 

F2{Q,p)=Y, Pkfi Qk,o + J2Y1 P^A ^t (Q) ■ 
fc=i k=iPk=i 

Indeed, from that specific form we obtain the relationship between variables qk,'y^ 
and (5fc,7fe, viz. 

qkfl — 7^ — Vfe.O) 

OPkfl 

qkA = ^ = ^k [Q) ■ 

dpkA 
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On the other hand, the conjugate momenta are transformed as 

dF K N-l Q^di 

Now we must check that the canonical transformation generated by F2[Q,p) be 
preserving the canonical Hamiltonian, viz. 



H,{q,p)^H,{Q,P) 
gradsky method, namely 



We start with the canonical Hamiltonian (3.88) obtained in the generalised Ostro- 



K K N-2 

H,{Q, P)=Y1 Pk,o Qfc,i + Y.Y1 P^'-^ 2'^.-fc+i 

fc=l fc=l(Tj;=l 

K 

+ 2_^ Pk,N-l Qk,N~l — L[Q, Q) . 
k=l 

Utilising relation (3.94a) we expand the first term of He, which is 

K K K N-l Q^di 

k=l k=l k,l=lf3i=l ^^'^ 

Making use of relation (3.94b) we find for the second term of i^c the series expansion 

K N-2 K N-l N~2 r.yPi 

Y Y Pk,ak Qfc,<7fe + 1 = X] X] X] QO ' ^'''^' '^k,ak + l- 



Repeating once again this procedure for the third term of H^ we obtain 

d: 

dQk,N-i 



K _ K N-l Q^pi 

k=l k,l=lt3i=l 



In agreement with the proof of Proposition 3.2.7 we derive 

K N-2 r,yN-l K p.yN-1 

1=1 p,=0 ^'■'P^ 1=1 ^''^^ ^ 
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which enables us to rewrite the third term of He as 

K K K N-2 Q^N-l 

2_^Pk,N~lQk,N~l = 2_^Pk,N-iqk,N~l — 2_^ A^ "^ Pk,N-l Ql,pi+1- 

k=l fc=l k,l=lpi=0 ^^'P^ 

Summing up all these intermediate results we obtain for the canonical Hamiltonian 
He the expression 

K K 

Hc{Q,P) = '^PkflQk,! +'^Pk,N-iqk,N^i - L{q,q) 

k=l fc=l 

K N~l N~2 Q^iSi K N-2 q^N-1 

+ I^ IZ I^ an! ^''^' Qfc'pfc+i - J^ J^ -an, — pkn-iQi,pi+i- 



On account of equations (3.83) it is not difficult to show that the last two terms of 
the above expression simplify to 

K N-2 

Hence the canonical Hamiltonian H^ reduces to 

K N-l K 

Hc{q,p) =^^ VkA-iqPk +^PKN-iqk,N-i - L{q,q), 

A;=l/3fe = l k=l 

which coincides with the canonical Hamiltonian (3.59) of the standard Ostrograd- 
sky construction. ■ 

3.3 Higher-order theories of gravity 

3.3.1 A short summary of canonical general relativity 

Tnree-plus-one splitting oi space-time 

Because general relativity is an already parameterised theory it seems natural to 
'de-parameterise' it so as to make the constraints manifest and proceed to its canon- 
ical formulation (cf. the discussion on page 59). The celebrated path to achieve this 
programme consists in foliating space-time into three-dimensional spacelike hyper- 
surfaces parameterised by a global time function.^ There are, so to speak, two 

^There are numerous reviews on the 3 + 1-splitting of space-time, which was firstly used in 
the development of the adm formalism; one can pick out, e.g., [ADM62, BG70, MTW73, Mac75, 
Kuc81, Que92]. 
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opposite ways of looking at the reliability of the 3 + 1-splitting. The first, which 
is the most convenient to adopt, advocates its use for any field theory because it 
yields naturally the determination of the degrees of freedom, the constraints, the 
gauge freedom, and the field evolution equations [IN80]; in general relativity, in 
particular, it provides a good insight into the nature of constraints and it simplifies 
the action principle and the initial- value problem [BG70]. The second point of view 
stems from the idea that the 3 + 1-splitting seems to be contrary to the whole spirit 
of general relativity. Furthermore this splitting restricts the topology of space-time 
to be the product of the real line with some three-dimensional manifold. In regard 
to quantum gravity this inhibition is not welcomed for one would like to allow all 
possible topologies of space-time [Haw79]. Henceforth we adopt the first, quite 
conservative, point of view. 

The 3 -|- 1-splitting of space-time relies on the following important result (see 
[Wal84, Chapter 8]). 

Theorem 3.3.1. Let M be a time-orientable globally hyperbolic space-time en- 
dowed with a Lorentzian metric gab of signature ( — 1--|— 1-). Then [M^gab] is stably 
causal. Furthermore a global time function t can be chosen such that each surface 
of constant t is a Cauchy surface. Thus M can be foliated by Cauchy surfaces S^ 
and the topology of M is M x S, where S denotes any Cauchy surface. 

ADM variables. The basic geometric data of this decomposition are: 

(I) the intrinsic geometry of the hypersurfaces St, described by the induced 
three-dimensional Riemannian metric hab on each S, which is defined by the 
formula 

hab = gab + naUb, (3.95) 

where n" denotes the unit normal vector field to the hypersurfaces T,t (note 
that ha plays the role of a projection operator from the tangent space to M 
onto the tangent space to E; ha : TpM i-^ TpS); 

(II) the way one goes from one hypersurface to the other, determined by the lapse 
function N and the shift vector N'^ which enable to decompose a 'time-flow' 
vector field t" on M satisfying f^Vat = 1 into its parts normal and tangential 
to S, viz. 

N = -eua = [n'^Vat] , (3.96a) 



Na = habf' (3.96b) 
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{N measures the rate of flow of proper time with respect to coordinate time 
as one moves normally to S, whereas N'^ measures the 'shift' tangential to S 
contained in the vector field t"'); 

(III) the way each S is embedded in (M,(7a6), provided by the extrinsic curvature 
tensor Kab which is defined by 

Kab = -h^hffV(^^nd) = -hf^'^VcUa = --Cnhab, (3.97) 

where Cn denotes the Lie derivative along the normal to S (one may think 
of Kab as a generalised notion of time derivative that describes the 'bending' 
of E in space-time). 

Prescriptions (l-IIl) above imply that, in terms of N, N"-, and t"", we have n" = 
(i" — N"-)/N and hence the inverse space-time metric can be written as 



9 



ab r^ab 



Ifh _^-2(^^a_^a-^(^^b _^by (3gg^ 



This suggests to choose, as the field variables, the spatial metric hab, the lapse 
function A^, and the covariant shift vector Na rather than the inverse metric g"'^ , 
which is usually used in the variational principle. This equivalent set of variables 
is usually referred to as ADM variables. Furthermore the three-metric hab uniquely 
determines the derivative operator on S compatible with hab, which we denote as 
Da. Regarding tensor fields on S as fields on M with all their indices orthogonal 
to n" we can re-express the action of D^ in terms of that of Vq, i.e. the derivative 
operator associated with Qab, viz. 

DcT"-\...,^ = h'^\ . . . \;^hJVfT''^-\,,,,^. (3.99) 

The derivative operator Dq brings forth a curvature tensor ^^'R%f.fi on St. It is easy 
to show that, in terms of D^, the extrinsic curvature tensor in (3.97) takes the form 

Kab = ^{-Cthab+BaNb + BbNa). (3.100) 

With respect to a coordinate basis the normal to S^ has the components n" = 
(1, —N''-)/N and n^ = {—N, 0) respectively and the metric gab can be cast into the 
form 

ds^ = hij {dx' + N'dt) {dx^ + N^dt) - N^dt (g, dt, (3.101) 

which enables one to identify its components, Qap, as well as those corresponding 
to its inverse, g"'^ , viz. 
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We thus see that the covariant components gij and hij coincide, whereas the con- 
travariant components g"^^ and /i*-' do not. Moreover, with regard to volume ele- 
ments, we obtain \/~9 = Nyh by application of the Frobenius-Schur formula. In 
terms of N , Ni, and hij, the components of the extrinsic curvature tensor defined 
in (3.97) and (3.100) are given by 

K^, = ^{-^+N^,+N,\,), (3.103) 

where a vertical stroke denotes covariant differentiation on S^ (a semi-colon denotes 
covariant differentiation in space-time). 



The Gauss, Codazzi, and York equations. The space-time curvature tensor R%^^ 
is connected with the curvature tensors on S, i.e. ^^'R%cd ^^^ ^ab, through the 
Gauss equation 

^')R\,, = h\h/h/hfR"}g, + 2K'^^,K,^,. (3.104) 

Suitable contractions of this equation with the three-metric hab yield 

(%,6 = K'htfR^d + n'nfK"h,''Recfd + 2K\,K,]a- (3.105) 

In a similar way we derive the Codazzi equation 

K^hfhg^naR\,d = 2D[/Ke]9, (3.106) 

and its contraction, 

hS^Rab = BcK - BaK^c- (3.107) 

In addition to the Gauss equation (3.104) and the Codazzi equation (3.106) we 
derive, after tedious calculations, the York equation^ 

n'^n'^hJ'h/Rabcd = CnKef + D(ea/) + KegK^^ + a^af, (3.108) 

where Oc stands for the four-acceleration of an observer moving along the normal 
to S, viz. ttc := n^VhUc- Note that in the synchronous Gauss system {N = 1 , 
A^* = 0) equation (3.108) reduces to the most commonly found expression 



dKi^ 
dt 



RoiOj = ^ + K,kK), (3.109) 



^It is actually an exercise proposed by York in [MTW73]; a detailed proof can be found in 
[Que92]. 
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on account of the following formula expressing the Lie derivative of Kij 

Contraction of the York equation (3.108) readily yields 

n^vf'Rab = h^^CnKab + /j"''D(,a,) + K^^i^"^ + a^a'. (3.111) 

Making use of the contracted Gauss equation (3.105) and the contracted York 
equation (3.111) we obtain a general expression for the scalar curvature, viz. 

R = (3)i? + K^ - SKabK'"'' - 2h^^CnKab - 2/i"^D(„a5) - 20^0^ (3.112) 

From equations (3.111) and (3.112) we obtain 

2n''n^Gab = ^^^R + K^ - KabK^K (3.113) 

Hence setting the left-hand side in equations (3.113) and (3.107) to zero we find 
the initial- value constraint equations of general relativity in vacuum. 

For completeness we may write the Codazzi equation (3.106) and the York 
equation (3.108) in terms of the Weyl tensor, viz. 



h„ ht h„ UaC ij^ji — 2 



tg "./ ng Ua^ bed 



heh/hg' - -h'^'iKghf - hfghf)\B[,Ka]b, (3.114a) 

X {CnKac + ^^'^Rac + KKac + D(„a,) + OaOc) • (3.114b) 

We can specialise the Gauss, Codazzi, and York equations (3.104), (3.106), and 
(3.108), which are pure tensorial expressions, to the ADM basis, which is defined 
by {e„ = {dt — N'^di)/N, ei = di}. We obtain respectively 

i?',,fc = (%',,fc + K,kK'j - K,jK\, (3.115a) 

-R\fe = Kij\k - Kik\j, (3.115b) 

Rninj = CnKij + ^ + K,kK). (3.115c) 

The set of equations (3.115) does provide all the information that is necessary to 
express the components of any four- dimensional curvature tensor in terms of the 
intrinsic and extrinsic three-dimensional tensors and of the lapse function and shift 
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vector. For instance, the components of the Einstein tensor are given by 

Gnk = K^k - K\^^, 

Gki = ^""^Gki + KKm - 2Kk,KS - hki{K^ - SKijK'^) 

On the other hand, equation (3.112) for the scalar curvature reduces to 

R = (% + i^2 - 3KijK'^ - 2N-'^h'^Niij - 20CnKij. (3.116) 

Adm gravitational Hamiltonian — ouriace terms 

In most field theories the Hamiltonian can be derived from the covariant action 
in a systematic way. In general relativity the situation is more intricate due to 
the presence of a surface term in the Einstein-Hilbert action. This difficulty 
was not addressed by Arnowitt, Deser, and Misner (adm) who conducted the 
first major investigation into general relativity — conceived as a dynamical system 
[AD59, ADM59, ADM60, ADM62]. Although they highlighted the importance of 
the constraints and showed that the Hamiltonian is precisely the spatial integral 
of the constraints, they inadvertently discarded a total derivative term inherently 
present in the gravitational action. By contrast, DeWitt was the first to realise 
that the identification of the Hamiltonian with the constraints is only valid in 
the case when the three-manifold is compact, without boundaries [DeW67]; in the 
case where the three-manifold is open, boundary conditions become crucial. Con- 
sider for instance the asymptotically flat situation: DeWitt put the total derivative 
term — which becomes a surface term at spatial infinity — back into the Hamiltonian 
and recovered the fact that the value of the Hamiltonian at a solution coincides 
with the ADM energy. Further investigation by Regge and Teitelboim demon- 
strated that the 'correct' boundary conditions, together with the requirement that 
the Hamiltonian and its variations be well defined, lead to the existence of ten 
surface integrals — the Poincare charges — in the Hamiltonian of the theory, which 
are connected with the Poincare group of transformations acting at spatial infin- 
ity [RT74, HRT76]. Beig and 6 Murchadha further re-examined this formulation 
in the language of symplectic geometry (they also corrected a mistake in Regge- 
Teitelboim's work having to do with the generator of boosts) [B6M87]. 

On the other hand. Gibbons and Hawking have advocated the addition ad hoc 
to the Einstein-Hilbert action of the surface integral 2§KdTi, the explicit form of 
which was firstly determined by York in a different context [Yor72], where K is the 
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trace of the extrinsic curvature of the boundary [GH77]. There are two separate 
arguments requiring that this boundary term be added to the Einstein-Hilbert 
action. In the first argument, one demands that the solutions of the classical 
field equations be extrema of the action under all perturbations that vanish on 
the boundary, i.e. 6hab = on 9M. This means that, on the boundary, only the 
normal derivatives of the induced metric are allowed to vary. In order to satisfy 
this condition one has to add a compensating term to the action the virtue of 
which is to cancel the second-order derivatives of the metric present in the scalar 
curvature. This compensating term is precisely the surface integral of the trace of 
the extrinsic curvature. The second argument, which entails the same boundary 
term, is formulated in the context of the path-integral formulation of the theory 
[Haw79, Haw93]: The gravitational action must be such that the amplitude to 
go from an initial to a final hypersurface must be independent of any arbitrary 
intermediate hypersurfaces and their associated induced three-metrics. 

Hawking and Horowitz have recently re-examined this long-standing discussion 
on surface integrals for manifolds with boundaries [HH96a].^° Starting from the 
Einstein-Hilbert action they derive the gravitational Hamiltonian without discard- 
ing any surface term — in contrast with the prevalent procedure. In particular, they 
show that the boundary terms in the Hamiltonian come directly from the bound- 
ary terms in the action and do not need to be added 'by hand'. They generalise 
the definition of the ADM energy for space-times that are not asymptotically flat 
though asymptotically approaching a static background solution. (They also dis- 
cuss the effect of horizons and the relation between their area and the total entropy 
of black holes.) We follow their approach. 

We start with the gravitational action in vacuum, 

S[9] = -^ f V^R-^f dJ:K, (3.117) 

where K is the trace of the extrinsic curvature of the boundary and dE is the 
surface element. (We assume that the boundary 5M consists of an initial and final 
surface with unit normal n", and a surface near infinity S°°, with unit normal r", 
on which n" is tangent.) As indicated above, the surface term in the action (3.117) 
is required so that the variational principle yield the correct equations of motion 
subject only to the condition that the induced three-metric on the boundary be 
held fixed — see, e.g., [Wal84, Appendix E], the specific works of Charap and Nelson 
[CN83], and York's [Yor88] (and references therein). Note that the action (3.117) 
is well defined for spatially compact geometries only — e.g., closed cosmological 



^°See also Hawking and Hunter's generalisation in the presence of nonorthogonal boundaries 
[HH96b]. 
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models. ^^ 

Pure divergences are hidden in equation (3.112): We derive an expression of 
the scalar curvature where they appear explicitly. We firstly have 



R — 2[Gab — Rab) 



rfn\ 



(3.118) 



The first term in equation (3.118) is given by equation (3.113) whereas the second 
term can be evaluated from the definition of the Riemann tensor given on page 6 
with «" = n", viz. 



n^n^Rab = K^ 



rab 



KabK'"' - Vain'^Vbu'') + Vb{n^Van^). 
Hence the scalar curvature is given by the general expression 

R = (% -K^ + KabK""" + 2\Va{n''Vbn^) - Vb{n''Van'') 



(3.119) 



(3.120) 



which is equivalent to equation (3.112). Observe that in the adm basis the volume 
integral of the four-divergence V c{Krf) contributes the term dn{VhK) to the 
action whereas in a coordinate basis it yields dt{\hK) — yh {KN^)\]^; in the latter 
case, adding the four-divergence '^cO^^ we recover the expression first given by 
DeWitt [DeW67], namely -2dt{VhK) + 2y/h{KN^ - N\^)\k. 

When substituted into the action (3.117) the two total derivative terms in 
equation (3.120) give rise to boundary contributions according to the formula 



d^Xy 



-g^aA'^ 



M 



d xdn 



M 



-gA- 



daaA\ 



aM 



(3.121) 



where the surface element dca is defined by dua = Va dS 



is the unit normal to the boundary and e 



f fa 



^fay\h\^^^^ where 
-1 or -1-1 whether z/" 
is timelike or spacelike respectively). As indicated above, the explicit form of the 
surface integral was first written down by York in his analysis of canonical gravity 
based on the adm decomposition of space-time [Yor72]. The first total divergence 
in (3.120) neatly cancels the § K surface term on the initial and final boundaries: 
We have indeed —2J'j^d'^Xy/^Vc{Kn^) = —2§d?x\/TiK. The second, which is 
orthogonal to the normal n", only contributes to the surface integral near infinity ^^ 

^^For noncompact geometries one must choose a reference background which is required to be 
a static solution to the field equations. The physical action is then the difi^erence between the 
original action and the action that is evaluated on this background (see [GH93, HH96a] for a 
foolproof analysis). 

^^The boundary near infinity, denoted by S°°, is foliated by a family of two-surfaces S^ coming 
from its intersection with Et. 
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and is therefore only relevant for noncompact geometries. This surface integral also 
contains the contribution from the ^K surface term in the action (3.117); explicitly 
it is 

^y"^V^(v,r^-reaj =^^^^(ff"' + nV)v,r,, (3.122) 

where r'^ denotes the unit normal to T,°° . Hence the action (3.117) takes the form 



Slh] 



/™[liX,^('-"« + ^--''-"-''1 - 8^ivW%- 



(3.123) 



where ^-^'K is the two-dimensional extrinsic curvature of Sf° in Sj. 
We thus take as the gravitational Lagrangian density 

£[N, N\ hij] = ^ ((3)i? + K,jK'^ - K^) (3.124) 

and introduce the canonical momenta 11, IIj, and tTadm conjugate to A^, A^*, and 
hij respectively, namely^^ 



TT 



n 



52 



S{dtN) 
5£ 

S{dthij) 



0, 



Vh 
16^ 



{K'^ -Kh'^). 



(3.125a) 
(3.125b) 
(3.125c) 



The action (3.123) can be brought to canonical form by means of the usual proce- 
dure, viz. 



S= At 



<'dm dthij -N%- N'-K, 






where we have introduced the following quantities 



'K = 167r Gill.; nVr^^.TT,, 



ijkl 

ADM |j' 



ADM ADM 



\fh 



(=')i?, 



(3.126) 



(3.127a) 
(3.127b) 



^^Since A'' and A''* are cyclic variables, they have no dynamics — their conjugate momenta vanish 
identically. They can be considered just as Lagrange multipliers rather than phase-space variables. 
(This feature of the gravitational Lagrangian can be traced back to the diffeomorphism-invariance 
of general relativity.) 
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and the so-called DeWitt metric, 

Gijki = ——7=\hikhji + hiihjk — hijhki), (3.127c) 

and where an additional surface term stemming from the Legendre transformation 
has been taken into account. The canonical Hamiltonian is thus 

H, = I (iV^K + iV^IKi) + ^f^ {N^A^ ^^^K + N, <,„rj) . (3.128) 

This expression diverges in general. However, once the Hamiltonian for the ref- 
erence background has been obtained, one can define the total energy of a given 
stationary solution of the field equations to be simply the value of the physical 
Hamiltonian, 



E= — 



1-J\n^\{^^k-^^k,)+n,^ 



adm' 3 



(3.129) 



where the subscript '0' refers to the background solution. This expression, firstly 
obtained by Hawking and Horowitz, generalises the famous ADM energy for asymp- 
totically flat space-times [HH96a]. 

Remark. As indicated above, surface terms play a crucial role for noncompact 
geometries — they cannot be discarded. Another difficulty which arises with sur- 
face terms has to do with spatially homogeneous cosmologies: genuine Lagrangian 
and Hamiltonian formulations are lacking for class B models (see [Mac75] and ref- 
erences therein). This is because the very symmetries that are imposed prevent 
the boundary terms to be vanishing; hence the equations derived from symmetry- 
preserving variations are the wrong equations. (We re-examine this issue in greater 
detail in Subsection 4.1.2.) 

Canonical quantisation — Wheeler-DeWitt equation. From equation (3.125) we 
see that there are primary constraints, H « and Hj ~ 0; their complete treatment 
was performed by Dirac, DeWitt, and others — this is discussed in several textbooks 
(see, e.g., [Hel80, Esp92]). 

Requiring the preservation in time of the primary constraints one finds as sec- 
ondary constraints the super-Hamiltonian, !H « 0, and super-momentum, Jii ~ 0. 
(The consistency algorithm does not lead to any new constraints.) For compact 
geometries the total Hamiltonian is thus given by 

Ht= f (n:K + N':Ki + ^H + fi%) , (3.130) 
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where fi and ^u* are Lagrange multipliers. All the constraints are first class: The 
super-Hamiltonian "K is responsible for the dynamics whereas the super-momenta 
"Ki are the generators of spatial coordinate transformations on St. Strictly speak- 
ing, the constraints "Ki ^ express that the state of the universe depends only 
on the intrinsic three-geometry of the spatial sections and not on the coordinates. 
(This leads naturally to the concept of Wheeler's superspace [Whe64].) 

In conformity with the canonical formulation it is clear that Einstein's equa- 
tions can be split into two sets: the dynamical equations and the initial-value 
equations. The former are the evolution equations for the dynamical variables hij 
and vTadm; these cannot be freely specified on a given hypersurface T,t owing to the 
constraints, which constitute the second set of initial-value equations. Both groups 
of equations are intimately connected. Actually, the initial- value equations contain 
all the dynamics of the gravitational field so that one could say that Einstein's 
equations are highly redundant. Solving the constraints and fixing the gauge — i.e. 
picking up a specific coordinate system — one removes four degrees of freedom out 
of the six pairs of canonical variables; hence the number of physical degrees of 
freedom is two, which correspond to the two helicity states of the spin-2 graviton. 

There are at least three distinct methods to proceed to the canonical quantisa- 
tion of general relativity: the ADM procedure in which all the constraints are solved 
and the gauge is fixed before quantising (reduced formalism); the Dirac modus 
operandi in which the constraints are imposed as restrictions on the quantum state 
of the universe; and an intermediate approach devised by Kuchaf [Kuc72]. In all 
these approaches the quantum state of the system is represented by a wave func- 
tional ^[/lij] — a functional on Wheeler's superspace. An important feature of this 
wave function, which appears when one considers closed cosmological models — the 
basic assumption of quantum cosmology; see [Hal91] — , is that it does not depend 
explicitly on the coordinate time label t (as a direct consequence of the vanishing 
of the super-Hamiltonian). 

According to the Dirac quantisation method the wave function is annihilated 
by the operator versions of the (first-class) classical constraints: The usual substi- 
tutions for the momenta having been made, viz. 

<BM - -^T^, (3.131) 



5hij ' 



one obtains the following equations for * 



14 



(5* 

!H/fc'I' = 2iDi— — = [momentum constraint], (3.132a) 



*We do not address here the operator-ordering issue. 
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^K* 



-WttG^ 



ijkl 



^ ^(3)^' 



5hij Shki 167r 



^ = [Wheeler-DeWitt equation]. (3.132b) 



The momentum constraint (3.132a) is nothing but the quantum mechanical ex- 
pression of the invariance of the theory under three-dimensional diffeomorphisms. 
The Wheeler-DeWitt equation (3.132b) is a second-order hyperbolic functional 
differential equation describing the dynamical evolution of the wave equation in 
superspace; it must be supplemented by appropriate boundary conditions. 

Remark. As an alternative to the canonical quantisation procedure one can con- 
struct the wave function using a path-integral approach (see, e.g., [Hal91, Haw93] 
and references therein). 



3.3.2 Hamiltonian formulation of nonlinear gravity theories 

Consider the Lagrangian density describing nonlinear theories of gravity, namely 



-gf{R), with/'VO, 



(3.133) 



where f[R) is a nonlinear arbitrary smooth function of the scalar curvature and 
primes denote differentiation with respect to the scalar curvature. 

We assume that the space-time is foliated into three-dimensional Cauchy hy- 
persurfaces Sj and we adopt the ADM basis (cf. 3 -|- 1-splitting, on page 89). On 
account of the explicit form (3.116) of the scalar curvature in terms of the intrin- 
sic geometry and extrinsic three-curvature tensor Kij on St and according to the 
definition (3.97) of Kij in terms of the Lie derivative of the three-metric hij along 
the normal to Sj, we may rewrite the Lagrangian density (3.133) as 



£ = NVhf(R{hij,Cnhij,Clhi, 



(3.134) 



Since the lapse function A'^ and shift vector A''* are not dynamical variables, we 
regard them merely as Lagrange multipliers associated with the gravitational con- 
straints. Furthermore we interpret the Lie derivative along the normal to St as a 
generalised notion of time differentiation.^^ We follow the generalised constrained 
Ostrogradsky construction developed in Subsection 3.2.4 in order to cast the theory 
into canonical form. 

Remark. Very recently, Ezawa et al. have performed a similar analysis of the nonlin- 
ear Lagrangian density (3.133) [EKK+98]. They obtain comparable results albeit 
utilising a slightly different Hamiltonian formulation: They intimately follow the 



^'To bear in mind this interpretation and to make the comparison with the general formalism 
easier we sometimes use a dot as an alternative notation for the Lie derivative. 



Qo,ij 


= hij, 




qi,ij 


= Ql,ij{hij,Cnhij) - 


= Kij, 


qi,ij 


— *^n^ij' 
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generalised Ostrogradsky method referred to as Buchbinder-Lyakhovich's method. 
Strictly speaking, they introduce velocities with the definition (3.84) (as indicated 
above, cf. the remark made on page 83, such an adjunction is unnecessary); they 
treat A'' and A'^* as genuine canonical variables; and they use the partial time deriva- 
tive dt instead of the generalised notion of time differentiation provided by the Lie 
derivative €„, along the normal to S^. In contrast with their approach the method 
presented here is more straightforward and clear-cut. 

In regard to the second-order Lagrangian density (3.134) the generalised Os- 
trogradsky prescriptions (3.75) entail the following natural choice for the auxiliary 
field variables: 

(3.135a) 
(3.135b) 
(3.135c) 

Owing to the definition (3.135b), conditions (3.76) are automatically fulfilled since 
equation (3.77) becomes 

^nh'ij = qo^ij = —2qi^ij, (3.136) 

which actually corresponds to equation (3.82). 

The extended Lagrangian density that includes the above Lagrangian constraint 
(3.136) supersedes the original Lagrangian density (3.134); it is given by 

^{qo,qi,qo,qi,^) = N-^2{qo,qi,qi) + {qo,ij + 2qi^ij) X'^ , (3.137) 

with suitable Lagrange multipliers A*-' so as to recover the interpretation in terms of 
the original set of field variables. (Observe that an overall N has been factorised.) 
The momenta canonically conjugate to the field variables are defined respec- 
tively by 

Po'=P'':=7r^ = A^^ (3-138a) 

^ - ^" - IS; ^ "''wk) - -2^0 nny (3.138b) 

n]"^' := ^ = 0. 3.138c 

Hence we have the primary constraints 

^^i = p^i - A^^' « 0, (3.139a) 

</'*^(go,gi,Pi)=P''^«0, (3.139b) 

li^^^ « 0, (3.139c) 
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where 'P^*-' is the traceless part of V^^ . The constraint (3.139b) arises from the 
indeterminacy of (^CnKij^ in the definition of momenta (3.138b). 

Now we perform a restricted Legendre transformation. Prom equations (3.116) 
and (3.138b) we obtain 

'^ - V'^CnKij = -'^(r- (% + ZKijK'^ -K'^ + 2N-^h'^Nu.) . (3.140) 



The last term in equation (3.140) gives rise to the surface integral at spatial infin- 

ityie 

-/ d^xivN^'' - Nr\'')rk, (3.141) 



3„„ 

which, in general, does not vanish for noncompact geometries. Henceforth we as- 
sume that we are dealing with actions that do not produce such nonzero surface 
terms. (We shall come back to this specific issue in the study of spatially homo- 
geneous cosmologies within the Hamiltonian framework, in Subsection 4.1.2.) The 
surface term (3.141) is the only one arising in the theory, in contradistinction to 
canonical general relativity, where the additional boundary term dn{vhK) must 
be cancelled out by the Gibbons-Hawking's compensating boundary term. 
The restricted Hamiltonian density is 

S)r{h, K,V) = V'^CnKij - iV-i£(i?) 

= ^ (^^^^R - 3K,,K'^ + K^^ - h'^Viij + V{r), (3.142) 

where the 'potential term' 

V{V) := Vh[Rf{R) - f{R)]^_^^^^^ (3.143) 

has been introduced — upon eliminating R for F{T'), the function that is obtained 
by solving the trace of equation (3.138b) for the trace of the momentum V^K 
The canonical Hamiltonian density is given by 

S}c{h,K,p,V) = ^r{h,K,V) - 2p'^Kij. (3.144) 

Therefore, the Hamiltonian form of the gravitational action based on the original 
Lagrangian density (3.133) can be written, up to boundary terms, as 

S= [ Ny^ C^hij + V'^ CnK,^ - Sjc{h,K,p,V)]. (3.145) 

Jm l -^ 



^^We keep the same notations as those used in the discussion on the boundary terms present 
in the gravitational action; pp. 94 fF. 
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The ensuing Dirac Hamiltonian density is 

^^ = N'K + N^%k + mv'' + ^''^'^if + iki(t>^'- (3.146) 

Preservation in time of the primary constraints (3.139) leads to the secondary 
constraints 

X^^ = 2p''^ + \VK'''^ « (3.147) 

and enables to determine the Lagrange multipliers Hij, u^^ , and ^ij. Aside from the 
usual super-Hamiltonian and super-momentum constraints, which are always first 
class, all other constraints are second class by virtue of the Poisson brackets 

{99^^(x),n2)(y)} = -5l6i5{^-y), (3.148a) 

{0*^(x), x''(y)} = jih^'h^' - h^'h^') <5(x - y). (3.148b) 

Therefore, we regard them as strong equations provided we introduce the appro- 
priate Dirac bracket. 

Expressing the Lie derivative in terms of the usual time derivative we recover 
the familiar form of the canonical action, namely 

S= f (f^ hij + V'^ kij -N%- N^%k) , (3.149) 

where the super-Hamiltonian and super-momentum constraints are given respec- 
tively by 



"K 



V 



6 V 



((% - -iKijK'^ + K^\ + V{V) - \h^3Vuj - 2p'^Kij, (3.150a) 



Wfc = jK\k - li'PKj)^ - 2hkp\y (3.150b) 

Counting the number of physical degrees of freedom is straightforward: Once 
the constraints have been enforced and some coordinate system has been chosen, 
there remains only three pairs of independent canonical variables. This is in total 
agreement with the particle content of the theory and with the fact that nonlinear 
theories of gravity are dynamically equivalent to scalar-tensor theories (cf. Sec- 
tion 2.3). We proceed now to a deeper analysis on this last property in the light 
of the Hamiltonian formalism. 

Dynamical equivalence with scalar-tensor theories 

Nothing whatsoever can prevent us from introducing a scalar field, $, as a new 
independent degree of freedom, by adopting, instead of the nonlinear Lagrangian 
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density (3.133), the equivalent constrained Lagrangian density 

£ = y^[/(^)+/'(^)(^-^)], (3.151) 

which is sometimes referred to as a Helmholtz Lagrangian density. 

According to the Ostrogradsky prescriptions, we define the auxihary variables 

Qo,ij = hij, go,* = ^5 

Ql,ij = Kij, (7o,<I> = £-n^, 

Ql,ij ^ L'Ti^ij^ 

and replace the original Lagrangian density (3.151) by the extended one, 

£(^0,91, ^0,91,^, A) =N-^^{qo,qi,qi,^) + (go,ii + 2gi,y) A*-''. (3.152) 

The conjugate momenta are given by (3.138) and IT^*) = 0. In addition to the 
primary constraints (3.139) we also obtain 

H = P + 6V^/($) «0, 

n(*) « 0. 

The Dirac Hamiltonian density is 

^D = i3c + mv^' + i^^'n^f + iki<^^' + CH + r?n(*), (3.153) 

where the canonical Hamiltonian density is given by 



iDe = ^'^^R - ^Ki,K'^ + ^K^ - \h'^V\i, + V{^) - 2p'^K,j, (3.154) 

with a 'potential term' 

y($) :=V^r$/($) -/($)!. (3.155) 

Time evolution of the primary constraints yields the determination of the Lagrange 
multipliers and produces the secondary constraint (3.147). All constraints are 
second class; we can readily eliminate the spurious degrees of freedom associated 
with A and $ from the action, bearing in mind that the function /'($) has to be 
inverted. The outcome of this procedure is precisely the canonical formalism that 
was developed for the Lagrangian density (3.133), where the variable V plays the 
role of the new independent scalar degree of freedom. 
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Extended gravity theories 

The Ostrogradsky method is also well suited for building up a canonical formalism 
of theories derived from Lagrangians that are functions not only of the scalar 
curvature R but also D^i? (such terms can be generated by quantum corrections 
to general relativity). The resulting theories, called generically extended gravity 
theories, have been studied in the context of inflationary cosmology; in particular, 
Wands has discussed their relationship with scalar-tensor theories [Wan94] (see also 
Section 2.3). 

Consider, for instance, the Lagrangian density that describes sixth-order grav- 
ity, namely 



g[R + -fRUR 



(3.156) 



which is dynamically equivalent to the scalar-tensor Lagrangian density (with two 
scalar fields) 



Ril + 7V31 + tDi^o) - TV'oV'i 



We may define the auxiliary Ostrogradsky variables 

0.0,13 = hi 

qi,ij 

and replace the original Lagrangian density (3.157) by the extended one 



(3.157) 



Ki 



C K 



(0) 

(0) 

.(0) 






h-^I^Z = R{(, 



^fO+7go" 



where we have used the useful formula 



.(0) 



(0) 






Uc, 



/i^'D.Dfe 



ClS^a^dc 



Then we may proceed in the same way as in the nonlinear j{R) case. (We do not 
elaborate further on this example since the procedure is systematic — although the 
ensuing expressions are very awkward.) 



3.3.3 Link between the Ostrogradsky and ADM formulations of 
Einstein's theory 

Ostrogradsky-ADM equivalence theorem 

Now that — as a direct application of the generalised constrained Ostrogradsky 
construction — we have achieved a consistent Hamiltonian formulation of nonlinear 
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theories of gravity, it is of great interest to examine whether this method could 
hkewise be used on the linear Lagrangian of general relativity. This prospect arose 
in connection with the problem of determining appropriate boundary conditions in 
a theory of gravity with higher derivatives. We have seen on page 102 that the only 
boundary term in the nonlinear case is a surface integral at spatial infinity. This can 
be easily understood since there is no need whatsoever to discard a total divergence 
that embodies second-order derivatives of the metric: The Ostrogradsky method 
is inherently appropriate to cope with higher derivatives. If one could treat — as 
we surmise — general relativity within the Ostrogradsky scheme, then exactly the 
same argument would apply: No boundary terms other than the analogue of the 
surface integral (3.141) would arise. What would then be the relationship between 
the Ostrogradsky formalism and the ADM version of canonical gravity? We intend, 
in this subsection, to unravel that possible connection. 

Our starting point is the Einstein-Hilbert Lagrangian density, 

£ = ,/^R. (3.158) 

We rely on what we did in the previous subsection since the linear Lagrangian 
density (3.158) may be viewed as a special instance of the nonlinear theory, provided 
we relax the condition on the second derivative of the function f{R) , that is /" = 0. 
Instead of utilising expression (3.120) of the scalar curvature as in the adm 
method, we exploit the formula (3.116). Thus we can formally rewrite the La- 
grangian density (3.158) as 

£ = NVhR{hij,Cnhij,Clhij). (3.159) 

Following the prescriptions of the generalised Ostrogradsky construction we intro- 
duce the auxiliary variables (3.135) together with the Lagrangian constraint (3.136) 
and we replace the original Lagrangian density (3.159) by the appropriate extended 
Lagrangian density with Lagrange multipliers A*-'. 

The momenta canonically conjugate to the field variables are given by equations 
(3.138), where f'{R) = 1, that is p^^ = A^^ V'^ = -2Vhh'^, and u\j^ = 0. We 
thus obtain the primary constraints 

if'^ = p'^ - yi ^ 0, (3.160a) 

^ij = pTiJ ^ 0, (3.160b) 

E = V + 6Vh^0, (3.160c) 

U[f « 0. (3.160d) 

(Note that the definition (3.138b) of the momenta p^ is now a primary constraint 
since f'(R) = 1.) 
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Performing a Legendre transformation we obtain the canonical Hamiltonian 

density, 

i3c(/i, K,p, V)=^ (^""^R - 3K,,K'^ + K^) - 2p'^K,„ (3.161) 

and a surface integral at spatial infinity, 

-2 / dakN^'', (3.162) 

which can also be derived straightly by setting V = —6\/h in the expression (3.141). 
The Dirac Hamiltonian density of the system is given by 

iDD = iDc + /u«/' + i^^'n^ii' + e«0'' + CS. (3.163) 

Preservation in time of the primary constraints (3.160) produces the secondary 
constraints 

X'^ = 2p'^ + \VK'^ « (3.164) 

and enables one to determine all the Lagrange multipliers. All constraints are 
second class; we can impose them as strong equations in the action, provided we 
define the appropriate Dirac bracket. We now demonstrate that the reduction 
process that consists in eliminating all the second-class constraints leads to the 
ADM form of the canonical action. 

Ostrogradsky— ADM equivalence theorem. The action of general relativity in 
the 'Ostrogradsky-Haniiltonian' form 

^osTRo = Jdt I n{p'^ CnK, + V'^ CnK^j - Sj,{h, K,p,V)'^ 

with the canonical Hamiltonian density given by equation (3.161), coincides exactly, 
after all the second-class constraints have been solved, with the adm action of 
general relativity (without the ad hoc compensating boundary term §K), 



ovr J goo \ 






(3.166) 



with the super-Hamiltonian and super-momentum defined by (3.127). 
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Prooi. Firstly, we expand the 'pg' terms in the Ostrogradsky action (3.165), mak- 
ing use of formulae (3.103) and (3.110), viz. 



N 



p'^ Cnhij + V'^ CnK.j = fi dth,, - Ip'm^j + V'^ dtK, 



VHN'^K,,j\,, + 2N\K,u). 



Solving the second-class constraints amounts to eliminating variables V^^ and Kij 
according to the strong equations 



Hence we obtain successively 

pii dthj + v'^ dtK,, = (V +ph'^)dthi, - 2dt{VhK) 



and 



^'Ludth,,-2dt{VhK) 



2p'^Ni\j - rHN'^K.jik + 2iV\i^,fc = 2{fmi\j + iV>|, 



2(p^^-iV,)|^. + 2iV,vr^^^l^.. 



Summing up the above results we have 



N 



p'^ Cnhi, + V' CnK.j = <^„„ dth,j - 2d4Vh K) 

-2(TrZ^Ni).. + 2Ny^ ,., 

V ADM '/ |j ' ADM |j' 



where we have utilised the identity 

2dt{VhK) - 2{p'm,)^^ ^ 2d„{VhK) + 2«,„iV,)|^.. 

The term — 2(7rADM-^i)| • yields the surface integral at spatial infinity in the ADM 
action (3.166), i.e. 

the term 2Ni-K^'' , . gives the adm super-momentum (3.127b). 

Now we examine the Ostrogradsky canonical Hamiltonian density in (3.165). 
We readily obtain from equation (3.161) the expression 



^^{h,K,p,V)^-Vh^'^R + ^[7r%^7rtr-l<u,, 



which obviously coincides with the ADM super-Hamiltonian (3.127b). 
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Boundary terms in the lignt oi the Ostrograasky approach 

The equivalence of the Ostrogradsky and adm canonical versions of the gravita- 
tional action of general relativity sheds a new light upon the question of boundary 
terms in the gravitational action. We can regard the Einstein-Hilbert action as a 
limiting case of the nonlinear gravitational action when f{Rj = R- In the latter 
case the only boundary term is the surface integral (3.141). The arguments that 
lead to the adjunction of the compensating term 2 jKdT, in the Einstein-Hilbert 
action come to naught in the higher-order case.^^ 

Firstly, there is no need whatsoever to eliminate the second-order derivatives 
of the metric. The Ostrogradsky construction of Subsection 3.3.2 shows that the 
configuration space is spanned by the induced metric and the trace of the extrinsic 
curvature. As a consequence, no compensating boundary terms are required to 
cancel surface integrals involving the latter quantity: The variation of K vanishes 
on the boundary dJA}^ 

Secondly, consider a transition from an initial configuration {h\fl,K^^>) on a 
hypersurface Si to a configuration {h\f^ , K^'^>) on an intermediate hypersurface S2, 
then followed by a transition to a final configuration {h/^^ , K^^>) on a hypersurface 
S3. As indicated above (on page 95), in a consistent path-integral formulation of 
the theory one expects that the amplitude to go from the initial to the final configu- 
ration should be obtained by integrating over all configurations on the intermediate 
hypersurface S2. This amounts to require 

S[9^^+9l'^]=S[9^^]+S[g^,% (3.167) 

where g\fj is the metric between Si and S2, which induces {^ab ^-^ab) °^ ^1 ^'^^ 
(/i|jj , K^"^') on S2; g\^ is the metric between S2 and S3, which induces {h\^ , K^"^') 
on S2 and {h^^j , K^^') on S3; and (7^^ -|- g^^^ is the metric obtained by joining 
together the two regions. In general relativity the three-metrics induced on the 
intermediate hypersurface S2 by g^^^ and (7^^ agree; but the extrinsic curvatures 
need not: this results in a (5-function in the Ricci curvature of g'^j, +5)^^ of strength 
given by the difference of the extrinsic curvatures on S2 [Haw79]. In the nonlinear 
case the fact that the respective traces of the extrinsic curvature tensors are one 
and the same on S2 is sufficient to remove the difficulty: This is fairly confirmed by 



^'^This was firstly argued by Horowitz in the context of Euclidean quantum gravity with a 
positive-definite action containing quadratic curvature terms [Hor85] (see also the similar work 
of Barth, where the Euler characteristic class for the manifold is taken into account [Bar85]). As 
this applies to the nonlinear case as well, we take up Horowitz's line of thought hereafter. 

^*In general relativity imposing that 5K be zero on the boundary is strictly forbidden since 
this would overdetermine the theory. 
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the explicit computation of the boundary variation of the nonlinear action [MB89], 
viz. 

5S\Q^ = 2<f f'6K. (3.168) 

It is not, in general, possible to write the right-hand side of equation (3.168) as 
the variation on the boundary of a functional, as is the case in general relativity; 
the only circumstances under which this programme can effectively been achieved 
are when the space-time manifold is maximally symmetric [MB89]. As a matter of 
fact, according to the discussion above on the crucial role of K, the right-hand side 
of equation (3.168) actually vanishes: The possible compensating term is reduced 
to naught. 

Furthermore, when passing from the nonlinear case to the linear Einstein- 
Hilbert action, one clearly sees how the boundary term 2<fKdT, materialises: The 
trace K becomes a spurious degree of freedom and thereby cannot be held fixed 
any longer on the boundary. 

3.3.4 Hamiltonian formulation of quadratic theories of gravity 

In the introduction of Section 3.2 on page 61 we have reviewed the works inspired 
by the Ostrogradsky method, with regard to the canonical formulation of higher- 
order theories of gravity. In addition, there are two major contributions dealing 
with quadratic theories of gravity that do not explicitly resort to the Ostrogradsky 
construction. Both of them require the reduction of the theory to a first-order 
form. To achieve this end Safko and Elston examined the connection between the 
Lagrange multiplier approach, thoroughly analysed in Section 2.2, and the ADM 
formalism [SE76]. They pointed out that there is a sharp difference between the 
constrained Palatini variational method and the Lagrange multiplier method that 
they used to cast higher-order curvature invariants into canonical form. Applying 
an 'Ostrogradsky-like' constrained formalism they succeeded — at least formally — 
in formulating a canonical version of the pure R^ theory. We are indebted to 
Boulware for the second contribution, in which he has undertaken the quantisation 
programme of quadratic gravity and has addressed a certain number of technical 
and physical issues arising when higher derivatives are present in a (gravitational) 
field theory [Bou84]. In contrast with Buchbinder and Lyakhovich's systematic 
approach (cf. page 63), Boulware's modus operandi seems quite heuristic at first 
glance. (However, in our opinion this is still the best reference with respect to 
canonical quadratic gravity.) One advantage of Boulware's formalism lies in its 
extensive use of the Lie derivative — instead of time differentiation — which appre- 
ciably simplifies the technicalities; this contrasts with Buchbinder and Lyakhovich's 
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formalism, which is — in that respect — quite difficult to decipher. 

In this subsection we rely on the generalised constrained Ostrogradsky scheme, 
which was thoroughly analysed in Subsection 3.2.4 and already applied successfully 
to nonlinear gravity theories in Subsection 3.3.2: We intend to cast the generic 
quadratic gravitational action into Hamiltonian form. 

Remark. The approach given here differs slightly from Boulware's as well as Buch- 
binder and Lyakhovich's treatments in the way we set it up. It is perhaps more 
transparent in regard to the Ostrogradsky method. The three methods lead to the 
same results, eventually. 

The generic quadratic theory. We consider the most general quadratic Lagrangian 
density in a four-dimensional space-time (M, gab) , 



£ = V^ (^A + ^R - -CabcdC^''" + ^R') , (3.169) 

where A is the cosmological constant, k^ = SttG^, and a, (3 are dimensionless 
coupling constants. 

Remark. Owing to the Gauss-Bonnet or Lanczos topological invariant in four di- 
mensions, we could equally have chosen jiRabR""^ + vR^ as the relevant quadratic 
Lorentz-invariant combination in the Lagrangian density, with ^ = —a/2 and 
v = (3/S + q/6. However — it is just a matter of convenience — ,the choice exhibited 
in equation (3.169) is more appropriate for discussing the particular variants of the 
general theory. 

As usual we assume that space-time is foliated into three-dimensional Cauchy 
hypersurfaces St and we adopt the ADM basis (cf. the 3 + 1-splitting on page 89). 
We recall here the expression (3.116) of the scalar curvature, viz. 

R = (% + K'^ - 3KijK'^ - 2N-'^h'^N\ij - 2h'^CnKij, (3.170) 

and we specialise formulae (3.114) to the ADM basis, viz. 



^ ijk 



S:S]Si - -h'\hk6'^ - h^5i)\ {K,,\t - Krtis), (3.171a) 

Cmni = I (<5f 4 - Ihi.h'^') {CnKki + N'^N^ki + KK^i + '^''^Rki) . (3.171b) 

Since the Weyl tensor identically vanishes in three dimensions, the quadratic 
Weyl invariant Cabcd.C'^^^'^ reduces to 

^abcd^ — 4^^0„jnjrO + ^nijk^ ), [6.L1Z) 

where the explicit forms ofCnijk and Cninj are displayed in equations (3.171) above. 
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As in the nonlinear case (cf. Subsection 3.3.2), we regard the Lagrangian density 
(3.169) as a functional of the three-metric and its successive Lie derivatives, namely 
£[hij,Cnhij,C'^hijy, we then introduce Ostrogradsky auxiliary variables according 
to the same prescriptions as given by equations (3.135); we replace the original 
Lagrangian density (3.169) by an extended one, i.e. £, and define the canonically 
conjugate momenta: 

p'J := -^=- = A*^ (3.173a) 



dCnh 



ij 



nSf:^^^a ,3.73c, 

At this stage note that the definition of V^^ differs from Boulware's choice, which 
requires that V^-' be zero at flat space. This option is adopted by Boulware because 
the linear term and the quadratic terms in the departing action are not treated on 
an equal footing in his formalism: The former is handled as in the ADM canonical 
version of general relativity whereas the latter are reduced to first order by means 
of a generalised Legendre transformation. In the formalism presented here — by 
contrast — we refrain to make such a segregation for we have shown previously that 
the generalised Ostrogradsky construction may be consistently applied to the adm 
action (cf. Subsection 3.3.3). 

The trace and traceless parts of V^^ are respectively 

V = -^^{k-'^ + ^R), V^'^ = -2aVhC'''''^. (3.174) 

The sole primary constraints of the general quadratic theory are (/?*■' ^ and 
n^-- « 0. Therefore, in agreement with the particle content of the theory, there 
are eight physical degrees of freedom. ^^ 

The next step in the Ostrogradsky construction consists in performing a gen- 
eralised Legendre transformation. Here we want to solve the 'velocities' CnKij 
for the canonical variables and their conjugate momenta. We achieve this goal by 
firstly solving equations (3.170) and (3.171b) for R and (7"*"-^ respectively, and then 
expressing these latter quantities in terms of V and P^*-' respectively, with the help 



^^The number of physical degrees of freedom is determined by the equation A^phys. ~ K — Ni — 
|A^2, where K is the total number of pairs of canonical variables, A''i and N2 are the number of 



first-class and second-class constraints respectively; see [HT92, p. 29]. 
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of formulse (3.174), thereby obtaining 



C K 



-pT^iJ 1 



a 



^/h 3/3 



-2 



+ :77^ K + 



V 
3Vh 



N, 



\'ij 



N 



Kij 



V 



KKijV' + ^ ((3)i? + k2 _ K^^K^]■^ . 



(3.175) 



Hence the restricted Hamiltonian density is 

^r {h, K, V) = V'CnK.j - iV-l£ 



V 



= Vh{aCm,kC'''''' + V{V) - A) + -{^""^R + K'- K^jK'^) 

'pTij'pT 

where we have defined the 'potential term' 

The canonical Hamiltonian density is thus given by 



(3.176) 



(3.177) 



(3.178) 



Therefore, the Hamiltonian form of the gravitational action based on the original 

Lagrangian density (3.169) may be written as 



S 



dt 



St 



N (p'^ Cnhj + V'^ CnKij - i3c {h, K, p, V 



fjNV%-N,V^^)r, 



(3.179) 



where the canonical Hamiltonian density is given by 

i3c(/i, K,p, V) = -2f^K^j + Vh{aCm,kC'''''' + V{V) - A) 



(3.180) 



Expanding the Lie derivatives in the action (3.179) and integrating by parts we 
obtain the final form of the quadratic canonical action, where we have retained all 
the surface terms, viz. 



S 



dt 



N(p'^ dthij + V'^ dtKij -N%- N^^Kk 



500 L 



NV^": . - N\jV^'' + 2Nj {p}^ + V'^K\) 



rk 



(3.181) 
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where the super-Hamiltonian "K constraint coincides with the canonical Hamilto- 
nian density (3.180) and the super-momentum constraint is given explicitly by 

JCfc = V'^Kij\k - 2{V^Kik) 1^. - 2Kkp'\y (3.182) 

Remark. Since the constraints are local functions of the field quantities at a par- 
ticular point of space, the Poisson bracket of two such objects is a sum of delta 
functions and derivatives of delta functions. In many instances it is simpler to inte- 
grate the constraints over the spatial sections with a test function: The constraint 
(/9(x) « becomes 

and it must be functionally differentiated with respect to the canonical variables 
so that the Poisson bracket of two such (integrated) constraints be given by 

|99a[V'],¥'b[V'']} := 

I ()in-D 1 1// 1 f)in A \ih\ ^cn-D 1 1// 1 / \ 

(3.183) 



d-\ 



6ipA [lp] SipB [ll^'] Sip A [lp] SifB [lp'] _ 

5hki JtF~ 6Kki 6V^^ * '^^ ' "^^ 



where the canonical variables are varied freely, i.e. independently of the constraints. 
The Poisson brackets involving the super-momentum constraint are easily cal- 
culated and understood, for !Kfc is the generator of spatial diffeomorphisms. For 
instance, a direct application of the definition (3.183) yields 

|G[i/'],/ifci(x)| = -V'(fc|o(x), 
where we have defined the integrated super-momentum as 



G[^P] :=y"d3x:>{„(x)^'"(x) 



On the other hand, it is very difficult to compute the brackets involving the super- 
Hamiltonian constraint. (Such a tedious task was carried out by Boulware; see 
[Bou84, Appendix B].) 

We now look at the possible simplifications that would arise if we specialised the 
generic quadratic action (3.169). Amongst the five distinct variants of the general 
theory, which were analysed by Buchbinder and Lyakhovich [BL87], only one turns 
out to be relevant for our purpose here, namely the conformally invariant theory 
(/3 = 0, A = = K~^). Strictly speaking, the case a = — and in particular the 
pure R theory — reduces to studying gravitational actions of the nonlinear type 
(cf. Subsection 3.3.2) whereas the cases corresponding to the vanishing of the sole 
coupling constant P may be set aside if one invokes the local conformal symmetry 
ab initio. 
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Conformally invariant theory. In the conformally invariant case the Lagrangian 
density is 

£ = -aNVh{2C^,r.jC^'^' + C„i,fcC«^^'=) (3.184) 

owing to the identity (3.172). The only alteration in the conjugate momenta (3.173) 
involves V^^ , which becomes 

V'^ = -2aV^C"^"^'. (3.185) 

Now, in addition to (/J*-' f» and n^- « 0, we obtain the primary constraint 

</, = hijV'^ = P « 0, (3.186) 

which arises because only the traceless part of the 'velocities', i.e. [CnKij^ , can 
be extracted from equation (3.171b). Performing as usual a Legendre transfor- 
mation on the appropriate extended Lagrangian density we obtain the canonical 
Hamiltonian density 

■pTij-pT 

9)c{h, K,p,V) = - 2f^K,, + aVhC^.jkC'''''' - -^—if 

- T^'^j - T""'' {^'% + KKJ^). (3.187) 

Therefore, we may cast the conformally invariant action into canonical form, that 
is, up to boundary terms. 



S= / N 
'm 



(3.188) 



p'^ Cnh^j + V' CnKij -^,{h,K,p,r) 
The associated Dirac Hamiltonian density is 

i3D = iOc + ^h/' + /^^Ht^ + ^'A- (3.189) 

Consistency of the primary constraints (3.186) when time evolution is considered 
yields the determination of the multipliers ^ki and v^'' and gives rise to the sec- 
ondary constraint 

X = 2p + Klir'"''^0. (3.190) 

According to the remark made on page 114, we may utilise test functions in order to 
simplify the computation of Poisson brackets. After tedious calculation we obtain 

{g[V'],x(x)}oc(^-x)|„(x)«0, {h[C],x(x)}ocC(x)^c(x)«0, 
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where we have defined 



H[C] :=y"d3xi^c(x)C(x). 



Hence the constraint (3.190) is the unique secondary constraint stemming from the 
consistency algorithm. Furthermore it is first class — it is indeed the generator of 
conformal transformations — by virtue of the Poisson bracket 



0(x),x(y)}=2P<5(x-y)«O. 



Therefore, the number of physical degrees of freedom is six. This is in total agree- 
ment with the particle content of the linearised theory. 

To proceed further we need to impose a gauge-fixing condition; we shall see in 
Chapter 4 how this can be realised, in the specific case of spatially homogeneous 
cosmologies, so that the trace of variables Kij and V^^ be removed from the set of 
canonical variables and the conformal constraint (3.190) be automatically satisfied. 
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Chapter 4 

Higher-order spatially 
homogeneous cosmologies 



"This cosmos, which is the same for all, has not been 
made by any god or man, but it always has been, is, 
and will be, an ever-living fire, kindling itself by regular 
measures and going out by regular measures. " 

— Heraclitus of Ephesus. 

THE simplest class of space-times that give physically reasonable cosmologi- 
cal models are the famed Priedmann-Lemaitre-Robertson-Walker (flrw) 
spaces, which are isotropic and are homogeneous on spacelike sections. It 
is conventional wisdom that FLRW models describe faithfully the large-scale prop- 
erties of our universe. They indeed successfully account for many of its observed 
features and constitute the basic pillar of the standard model of cosmology, which 
is not yet free from certain riddles: the so-called horizon, 'flatness', and 'smooth- 
ness' problems. These are addressed, more or less adequately, by incorporating 
the inflationary paradigm into the 'bare old-fashioned' isotropic cosmology and by 
invoking the existence of hot and cold dark matter blends. Be that as it may, 
mathematical cosmology cannot be satisfied with those models for their degree of 
simplicity, though it guarantees that one could easily find closed- form solutions, 
also implies that deeper investigations on specific issues — such as the asymptotic 
approach towards the initial singularity and the related question of the genericness 
of oscillatory, chaotic dynamical regimes — are screened by the stringent symmetry 
requirement that the universe be isotropic ab initio. Even though the observed 
universe seems to be highly isotropic, there may have been large anisotropics at an 
earlier epoch; in that respect, it is of great interest to understand why the actual 
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universe is so much less anisotropic than Einstein's field equations allow it to be in 
principle. 

One possible answer is to consider, as Penrose surmises, that the 'initial condi- 
tions' logically entail a vanishing Weyl tensor to ensure an isotropic universe from 
its very inception; another attitude towards that problem, which we adopt below, 
proceeds through generalising the FLRW models by dropping the isotropy hypoth- 
esis while keeping spatial homogeneity.^ In particular, this endows anisotropic 
cosmologies with a nonzero Weyl tensor, the existence of which may alter the 
character of the singularities that typically arise in cosmology (see, e.g., [CE79]). 
Moreover, the field equations remain ordinary differential equations since only time 
variations are nontrivial. 

In Section 4.1 we give a short account of spatially homogeneous cosmologies: 
Firstly, we summarise the geometrical setting that enables one to define and classify 
the spatially homogeneous anisotropic Bianchi cosmological models (for a very 
good synthesis the reader is referred to [ESW97]); then we discuss briefly the 
specific problem of nonvanishing surface terms in the Lagrangian and Hamiltonian 
formulations of Bianchi cosmologies both in general relativity and higher-order 
theories of gravity. 

In Section 4.2 we analyse the behaviour of the Bianchi-type IX model in the 
pure general quadratic theory of gravity on approach to the initial singularity in 
a four- dimensional space-time. This work was carried out in collaboration with S. 
Cotsakis, J. Demaret, and Y. De Rop [CDDQ93]. (We focus on the analytical treat- 
ment, leaving aside the details of the numerical analysis.) The — empty — Bianchi- 
type IX or mixmaster model is of fundamental importance in mathematical cos- 
mology for it is the most general spatially homogeneous model and it contains the 
closed FLRW model as a special case. Its major feature is its — chaotic — oscillatory 
behaviour when one reaches the initial singularity. The question that is addressed 
in this respect is whether this behaviour is altered when one considers more general 
settings than general relativity such as, for instance, purely quadratic theories of 
gravity. In this framework we show that the mixmaster universe possesses a non- 
chaotic solution which is stable and we prove that the so-called BKL approximation 
scheme is structurally unstable. 

In Section 4.3 we make use of the general Hamiltonian formalism that we have 
developed in Subsection 3.3.4 (p. 110 ff.) to study the specific case of spatially ho- 



^As isotropy at each point entails spatial homogeneity, anisotropic cosmologies constitute the 
first class of cosmological models that are (fairly) more general than the flrw models; they 
have been extensively studied for a couple of decades. By contrast, it is only very recently that 
inhomogeneous cosmological models have regained a genuine interest amongst researchers; see 
Krasihsky's 'encyclopaedia' for a recent and thorough account [Kra97]. 
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mogeneous cosmologies. Firstly, we adapt the Hamiltonian formalism to Misner's 
parameterisation; for this purpose we define a canonical transformation the virtue 
of which is to disentangle terms stemming respectively from the pure i?-squared 
and Weyl-squared variants of the general quadratic theory. This is helpful since 
one is able to treat those cases separately, thereby simplifying the analysis. In the 
first variant (i?-squared) we derive — for those Bianchi types admitting a canonical 
formulation — the super-Hamiltonian constraint, the reduced Hamiltonian density, 
and the corresponding canonical equations, which constitute an autonomous differ- 
ential system; in particular, we solve the latter analytically for Bianchi type I and 
thereafter compare our results with those found in the literature; for the Bianchi- 
type IX model we reduce the (first-order) canonical equations to three coupled 
second-order differential equations for the physical degrees of freedom; finally, we 
extend the discussion to FLRW models. This work was first undertaken during 
my MSc [Que92] and was carried out with J. Demaret [DQ95]. In the second 
variant (Weyl-squared), referred to as conformal gravity, we consider the simplest 
spatially homogeneous space-time that exhibits nontrivial physical degrees of free- 
dom, namely the Bianchi-type I model (the isotropic FLRW space-times are indeed 
conformally flat). We derive the explicit forms of the super-Hamiltonian and the 
constraint expressing the conformal invariance of the theory and we write down 
the system of canonical equations. To seek out exact solutions to this system we 
add extra constraints on the canonical variables and we go through a global involu- 
tion algorithm which eventually leads to the closure of the constraint algebra. The 
Painleve approach provides us with a proof of non-integrability, as a consequence 
of the presence of movable logarithms in the general solution of the problem.^ We 
extract all possible particular solutions that may be written in closed analytical 
form. This enables one to demonstrate that the global involution algorithm has 
proven to be exhaustive in the search for exact solutions. Finally, we discuss the 
conformal relationship or absence thereof of our solutions with Einstein spaces. 
More specifically we show that the necessary condition for a Bianchi-type I space 
to be conformal to an Einstein space, obtained from purely geometrical consider- 
ations, becomes — in conformal gravity — a sufficient condition as well; this enables 
us to determine which solution amongst the whole set of exact solutions is not 
conformal to an Einstein space. This work was carried out in collaboration with J. 
Demaret and C. Scheen [DQS98] (see also [Que98]). 



^I am indebted to C. Scheen for providing me with the material on the analytic structure of 
the Bianchi-type I model in conformal gravity (p. 169 fF.), which is partly contained in [DQS98]. 
For a thorough investigation on methods that probe the analytic structure of differential systems 
and their application to relativistic cosmology, the interested reader may consult Scheen's PhD 
thesis [Sch99]. 



128 Chapter 4. Higher-order spatially homogeneous cosmologies 

4.1 Spatially homogeneous cosmologies 

4.1.1 Geometrical setting 

A four-dimensional space-time (M, gab) is said to be spatially homogeneous if it 
possesses an r-dimensional isometry group G^^-* (invariance group of the metric 
gab) that acts transitively on a one-parameter family of spacelike hyper surf aces, 
the orbits of the group^ (so that r ^ 3), which provides a natural slicing of the 
space-time. At any point q on any such hypersurface S there are (at least) three 
nonzero linearly independent Killing vector fields tangent to S. Since a Killing 
vector field ^" is completely determined by the values of ^" and VaCb at any point 
q of S, there can be at most r^^x = ^n{n + 1) linearly independent Killing vector 
fields on a manifold of dimension n (see, e.g., [Wal84, Appendix C.3]); here, this 
means 3 ^ r ^ 6. When r = r-^^ax, the Riemannian space has constant curvature. 
Furthermore, a theorem due to Fubini states that a Riemannian space (for n > 2) 
cannot admit an isometry group of dimension r = ^n{n + 1) — 1 (see, e.g., [Eis66]); 
here, this implies that the only possible values of r are 3, 4, and 6. If r = 6, 
the space-time is not only spatially homogeneous but also spatially isotropic, that 
is, locally spherically symmetric, and belongs to the FLRW class, which features 
maximally symmetric spacelike sections, i.e. sections of constant curvature.^ If 
r = 4, the space-time is locally rotationally symmetric (lrs); there exists a three- 

(3) 

parameter subgroup G that acts either simply or multiply transitively on the 
spacelike hyper surf aces: The latter case includes the Kantowski-Sachs cosmological 
models (the orbits of the group are two-dimensional, maximally symmetric, and 
with positive constant curvature); the former corresponds to LRS Bianchi models 
and thus points to the last possible value of r, namely r = 3, which corresponds to 
the Bianchi class. Thus, if we set aside the Kantowski-Sachs models, we are left 
with isometry groups that act simply transitively on the spacelike hypersurfaces S, 
i.e. such that dimG = dimS = 3. Hence, in the light of the preceding discussion, we 
can state a more precise definition of Bianchi cosmologies: A Bianchi cosmology 
is a model the metric of which admits a three-dimensional group of isometrics 
that acts simply transitively on spacelike hypersurfaces S, which are surfaces of 
homogeneity in space-time. The complete list of Bianchi cosmological models is 
obtained by classifying the three-parameter isometry groups, that is, the three- 
dimensional real Lie algebras generated by the associated Killing vector fields; the 
outcome is nine nonequivalent Bianchi types — or ten equivalence classes — , named 



^The orbit of a point p under a group G'*^' is the set of points into which p is moved by the 
action of all elements of the group. 

■^Note that in the case of space-times (n = 4), the ten-parameter Poincare isometry group of 
flat space is an example of G'^"'; Minkowski and de Sitter space-times are maximally symmetric. 
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Table 4.1: The Bianchi types for class A. 
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+1 
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according to Bianchi's own terminology. Let us briefly sketch the classification 
procedure according to Wahlquist and Behr, Ellis and MacCallum, and Siklos.^ 

Suppose Xi for i = 1, 2, 3 form a basis of a three-dimensional Lie algebra with 
structure constants C\,, which are defined through the commutators [Xi,X, 



JJ 



C\jXk (they are antisymmetric and satisfy the Jacobi identity). Given C\p we 
can define a three- vector a* and a symmetric 3x3 matrix n*-' by aj := 2^\i ^^^ 
^ij ._ g«fc«^i(^j^^_^j^^-j respectively, where e^^^ is the unique totally antisymmetric 
tensor satisfying e^^^eijk = 3! = 6. It is easy to show that the structure constants 
can be written as C\, = n^^euj + 25^aj]. Substitution of this expression into 
the Jacobi identity yields the simple result ri^^aj = 0. According to Ellis and 
MacCallum, the classification now gives two broad classes: 'class A' (oj = 0) and 
'class B' (aj 7^ 0); the resulting Lie algebras are divided into several types according 
to the rank and the (modulus of the) signature of n*-'. In class A there exist precisely 
six distinct Lie algebras whereas in class B there are only four possible values for 
the rank and the signature of v}K Some types (VI and VII) can be subclassified 
with the help of a further invariant h, which can be determined by the formula 
OjOj = ^hn^^n^'^ersi^kij- The resulting algebraic classification is summarised in 
Table 4.1 and Table 4.2 (n^ for i = 1,2,3 denotes the diagonal elements of the 
symmetric matrix n*-' ; a is the nonzero component of the vector a, obtained after 
a suitable rotation of axis and rescaling; the number p refers to the dimension of 
the orbits of the group — it is related to the automorphism degrees of freedom — ; 
and the number q describes the degree of generality of the most general vacuum 
solution of each group type.) This completes the Bianchi classification. 

Given the classification, one can seek for each Bianchi type the corresponding 
three-manifold endowed with a metric and isometry group of that type; this is 



'For thorough reviews on the Bianchi-Behr classification and related questions see, e.g. 
[Mac73, Mac79a, Mac79b, Jan84, BS86]. 
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Table 4.2: The Bianchi types for class B. 
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Table 4.3: Basis one-forms in Bianchi types I, II, V, and IX. 



Type 



w' 



I II V IX 



dx dx — zdy dx cos y cos zdx — sin zdy 
dy dy e^dy cos y sin zdx + cos zdy 

dz dz e^dz —sinydx + dz 



achieved by choosing time and spatial congruences, and with the help of invariant 
one-forms (as well as the automorphism degrees of freedom in order to simplify the 
spatial metric [SikSO, Sik84, RE85]). We follow here the metric approach rather 
than the orthonormal frame approach (see [ESW97]). The spatially homogeneous 
metrics can be written as ds^ = — dr^ -|- hij{T)uJ^LO^ or, in terms of an arbitrary 
time variable, as ds^ = —N'^{t)dt'^ + hij{t)uj'^uj^ , where N is the lapse function and 
w* for i = 1, 2, 3 are the basis dual one-forms, which satisfy (owing to Cartan's first 
structure equations) the relation dw* = ^C^f^-uj^ f\u)^. The explicit formulae for 
types I, II, V, and IX are summed up in Table 4.3. 

4.1.2 Hamiltonian cosmology 

Besides its interest with regard to quantisation, the Hamiltonian formulation of 
general relativity has also been applied to a large variety of cosmological problems — 
both classical and quantum. One can trace back this kind of investigation to 
DeWitt who specialised the canonical formalism to the closed FLRW model, which 
was viewed as a toy-model for quantum gravity [DeW67]. Independently, Misner 
laid the foundations of Hamiltonian cosmology [Mis69a, Mis69b], that is, the study 
of cosmological models by means of equations of motion in Hamiltonian form — to 
be more specific, through the ADM reduction procedure and with the introduction 
of the fruitful concept of minisuperspace [Mis72]. 
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Typically, the advantage of the Hamiltonian treatment in general relativity is 
twofold. Firstly, it enables one to give an heuristic analysis of the generic asymp- 
totic behaviour of Bianchi models near the singularity (and at late times) without 
doing lengthy calculations: The analysis is reduced to the qualitative description of 
the motion of a point in a plane under the influence of a time-dependent potential 
[Rya72, Ugg97]. This type of investigation constitutes the qualitative Hamilto- 
nian cosmology. Secondly, the Hamiltonian formulation enables one to produce a 
Hamiltonian in a certain 'Lagrangian canonical' form that reveals the mathematical 
similarities amongst different 'hypersurface-homogeneous' models [UJR95]. 

As regards the scope of this thesis, such qualitative treatment of Bianchi dy- 
namics is quite difficult to adopt for we are dealing with higher-order theories, 
having more degrees of freedom than general relativity; hence the aforementioned 
qualitative analysis by means of simple potential diagrams breaks down. Notwith- 
standing this hindrance we would like to examine whether the Hamiltonian methods 
might produce some simplification of the intricacies that occur when one deals with 
higher-order field equations. The first aspect we want to discuss has to do with 
the variational principle as applied to spatially homogeneous cosmological models 
and, more specifically, to Bianchi cosmologies. 



Variational principle and boundary terms. 



In Section 2.1 we have given a short account of the metric variational principle 
in general relativity, without any reference whatsoever to spatially homogeneous 
cosmologies. Now if one tries to derive the field equations of the various Bianchi 
types by means of the Hilbert variational principle, then in general (as was firstly 
noticed by Hawking [Haw69], then confirmed by MacCallum and Taub [MT72]) one 
will not obtain the correct field equations. It was firstly claimed — erroneously — 
that the origin of the trouble laid in the utilisation of non-coordinate frames to 
perform the calculations [Rya74].^ But the true reason was given by Sneddon 
[Sne76]: The requirement of spatial homogeneity prevents a boundary term being 
set equal to zero.^ However, it turns out that for class A Bianchi models this 
boundary term identically vanishes. More specifically, if the metric is assumed 
to be spatially homogeneous, then the spatial integration in the ADM form of the 
Einstein-Hilbert action (3.166) can be performed to give 

5.BM = [dt [<,„ dth, -N:K- N':K^ - 2(7rf,„iV, - i7r,,„iV'=) | J . (4.1) 



''This is still advocated by Ryan [RW84]. 
'^See also the remark on page 98. 
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On account of the formula given on page 6 the spatial divergence in the action 
(4.1) reduces to the expression 

{<iuN^ - h^..uN')^^ = {nt^N, - ivr,,„iV'=)C^.,. (4.2) 

Hence, unless the trace of the structure coefficients vanishes — the condition C"'.^ = 
is precisely the characterisation of class A Bianchi models — , the variation of 
the ADM action with respect to the shift vector produces additional unwanted 
terms and results in a wrong super-momentum constraint. Although most class 
B models do not admit a Hamiltonian formulation, some particular nondiagonal 
and all diagonal^ models do: This can be achieved whenever the super-momentum 
constraint is holonomic, i.e. can be expressed as the vanishing of a total derivative; 
then it can be integrated and used to reduce the number of gravitational degrees 
of freedom [UJR95]. 

Since at the end of Chapter 3 we have developed a Hamiltonian formulation of 
the generic quadratic gravity theory it is natural to address the question of under 
which circumstances the spatial divergences occurring in the action vanish so as to 
warrant a well-posed variational principle. Consider the quadratic action (3.181) 
in canonical form and focus on the surface integral; assuming that the metric be 
spatially homogeneous and that spatial integration have been performed we readily 
identify the corresponding spatial divergence, namely 

NV^''y + 2Nj{p^^ + V''''K{)] . (4.3) 

Prom mere inspection it is obvious that the variation of this term will not vanish 
unless one considers class A models: the expression (4.3) is indeed proportional 
to the trace of the structure coefficients. Therefore, we end up with the same 
conclusion as in general relativity: All class A models do admit a Hamiltonian 
formulation. This result also holds for a nonlinear gravitational action, for the 
spatial divergence (4.3) becomes in that case 

Nj{p''' + VK^^)] (4.4) 

and is proportional to the trace of the structure coefficients as well. 

Remark. Whereas the super-momentum constraint "Kk is automatically satisfied 
for all diagonal class A models in general relativity, this is not necessarily the 
case in higher-order gravity, for the expression (3.182) of 'Kk is not as simple as 
in general relativity. As a matter of fact the first term in that expression is not 

*For example, Sneddon has contrived a method to "heal" the variational principle in the specific 
case of the diagonal Bianchi- type V model [Sne76]. 
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of the form 21*-',., where SI*-' is a tensor density; a direct calculation shows it is 
zero for all diagonal class A Bianchi types but type VIq in the specific conformally 
invariant case.^ This means that, unless one considers the latter situation, one can 
always safely choose the shift vector to be zero, which is the usual assumption in 
Hamiltonian cosmology. By contrast, in the aforementioned peculiar case one must 
check that this very choice does not break the equivalence with the field equations. 

4.2 The mixmaster universe in fourth-order gravity 

4.2.1 On the mixmaster chaotic dynamics 

The term 'mixmaster' refers to the Bianchi-type IX cosmological model in vac- 
uum [Mis69a]; it suggests the nice features of the Bianchi-type IX dynamics, i.e. 
the oscillatory regimes on approach towards the initial singularity (see [LL89]). 
Recently, this terminology has also been used to discriminate amongst asymptot- 
ically velocity dominated cosmological models (where the spatial curvature terms 
in the Hamiltonian constraint become negligible as compared to the square of the 
expansion rate as the singularity is approached; this corresponds to a Kasner-like 
behaviour) and oscillatory-like cosmologies the prototype of which is Bianchi type 
IX (and type VIII), which is of fundamental importance with regard to the investi- 
gations that address the issue of the genericness of chaos in relativistic cosmology 
on approach towards the initial singularity and of the nature of that singularity 
when spatial homogeneity is relaxed — cf. the Belinskii-Khalatnikov-Lifshitz (bkl) 
conjecture — (see [Ber98], and references therein). Interest in the mixmaster dynam- 
ics has increased dramatically in the last fifteen years (see the review in [CLL98]); 
controversies have arisen on the chaotic nature of the Bianchi-type IX model and in 
particular on the problem of (diffeomorphism-)invariant characterisation of chaos 
in relativistic systems (see [HBC94]). 

That the mixmaster dynamics be chaotic is now fairly well confirmed — though 
it has not yet reached the unambiguous status of a theorem — by means of different 
methodologies: qualitative methods, numerical techniques, and analytical tools. 
Qualitative methods embody: the BKL piecewise approximation methods [BKL70, 
KP72]; Hamiltonian methods (Misner, Ryan, and others) [Rya72, RS75, Ugg97]; 
and dynamical systems methods [WE97]. The BKL approach (see, e.g., [LL89]) has 
shown that the evolution in time of the mixmaster model can be approximated as a 
sequence of time periods (Kasner epochs and eras) during which certain terms in the 
field equations may be neglected, thereby leading to a description of the dynamics 



^Making use of the variables defined on page 147 we indeed obtain Jfi = = K2 and M3 

-2^{P+K-+P-K+) ^ for type VIo. 
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in terms of the Bianchi-types I and II models, and 'bounce laws' from one Kasner 
era to the other that are sensitive to initial conditions [Bar82, CB83]. Hamiltonian 
cosmology (see Subsection 4.1.2) has reduced the analysis of the field equations to 
that of a time-dependent Hamiltonian system in two dimensions for a particle that 
bounces on moving 'potential walls', which approximate the time-dependent poten- 
tials characterising the various Bianchi types. The dynamical systems approach is 
based on the fact that Einstein's field equations for spatially homogeneous cosmolo- 
gies can be written as an autonomous system of first-order differential equations, 
thereby defining a dynamical system; it was initiated by Collins [CS71, Col71] and 
developed extensively by Bogoyavlensky [Bog85], Wainwright [WH89], and others. 
Historically, numerical techniques have proven to be misleading, but the associated 
controversies have been smoothed away. More reliable algorithms are now used, 
mainly to probe the structure of the singularity in inhomogeneous space-times 
[Ber98]. Most recently, Cornish and Levin gave a very strong indication point- 
ing towards the chaotic character of the mixmaster model, by exploiting fractal 
methods [CL97a, CL97b].^° Analytical tools tackle the 'chaoticity issue' in terms 
of integrability concepts — integrability of differential systems, which is inherently 
encoded into their analytic structure. The Painleve method (see p. 169 ff.), which 
provides such an analytical tool, rests on the Painleve property — a differential sys- 
tem possesses this property if and only if its general solution is uniformisable or, 
equivalently, exhibits no movable critical singularities. Integrable systems are de- 
fined in the sense of Painleve as those systems that possess the Painleve property. 
The trivial part of the Painleve method, known as the Painleve test, produces nec- 
essary but not sufficient conditions for a system to enjoy the Painleve property and 
requires local single- valuedness of the general solution in a vicinity of all possible 
families of movable singularities (see, e.g., [RGB89, Con94, Con99]). Notwith- 
standing the fact that it is not conclusive — especially for real-time chaos — , the 
Painleve method has proven to be helpful in understanding the intricate structure 
of systems such as the mixmaster model [LMC94]; even more, the fruitful inter- 
play of the Painleve method and complex-time numerical integrations has been 
exemplified very recently [SD97, Sch99]. 

It has also been demonstrated that the mixmaster chaotic behaviour, which 
appears in a four- dimensional space-time in general relativity, is generically sus- 
tained as one increases the space-time dimensionality up to ten, but disappears 
in any universe with space-time dimension greater than ten [DHS85, DHH"'"86, 
HJS87, EH87, DDH88]. So, in the context of general relativity, the mixmaster 



^°One must bear in mind, however, that the fractal structure is obtained numerically: One 
must still be cautious as regards the conclusions of that analysis, even if the whole construction 
is very interesting conceptually. 
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evolutionary picture is dimensionally dependent. 

In the next subsection we address the question of whether this picture is sensibly 
modified when one considers more general frameworks — in particular, higher-order 
theories of gravity. One possible scope of this investigation would be to determine 
how general the features of chaotic evolution met in general relativity are in the 
framework of all possible physically interesting theories of gravity. This programme 
also hopes to shed new light into the cosmological structure of gravity theories with 
higher derivatives: One possibility would be, for instance, that nice nonchaotic 
properties emerge near the space-time singularity, as is the case for the mixmaster 
model in a theory described by a Lagrangian of the type L = R + ^iB? [BC89, 
Cot90] or by a scale-invariant Lagrangian such as L = i?^ [BS89, SZ93]. Here 
we examine the structure of the Bianchi-type IX cosmological model in the pure 
quadratic theory of gravity, i.e. the theory that is described by the action (2.16) 
without the Einstein-Hilbert term of general relativity. ^^ 

4.2.2 Asymptotic analysis of the field equations 

The vacuum field equations derived from the general quadratic action 



S= I d^x 



71^1 + 72^2 + 73-2a 



(4.5) 



where 7i for i = 1,2,3 denote coupling constants, are obtained by gathering the 
Euler-Lagrange derivatives (2.20) together, viz. 

71 hV'V^R - 2g''^UR + -g^^R^ - 2RR''^\ + 72 (- g"-^ R^dR^"^ + V^V'^R 

- ADR"'' + 2V"V^ii - 4R'"'"''^Rcd + A.R'^'R^] = 0. (4.6) 

We are interested in the behaviour of the spatially homogeneous Bianchi cos- 
mological model of type IX, which is described by a metric of the form 

ds'^ = -dt^ + hij{t)uj'LO^ , (4.7) 

where w* for i= 1,2,3 are the S0(3)-invariant differential forms that characterise 
Bianchi type IX (given explicitly in Table 4.3). The induced three-metric is as- 
sumed to be diagonal and of the form 

h,,=diag[a''it),b\t),c\t)], (4.8) 



^^ Discarding that term is consistent with our purpose since we consider asymptotic solutions 
to the field equations hereafter. 



136 Chapter 4. Higher-order spatially homogeneous cosmologies 

where a,b,c are the scale factors. Substituting the Bianchi-type IX metric (4.7) 
together with the exphcit form (4.8) into the vacuum field equations (4.6) we obtain 
the mixmaster field equations in terms of the scale factors a, b, and c. We write 
only the mixed ("g)- and (^]^)-components (the (^2)" ^^d (^3)-components can be 
obtained from the (^^)-component by cyclic permutations). The result is 

[7i£i + 72^2 + 73^3] °o = 0' (4-9a) 

[7i£i + 72^2 + 73^3] \ = 0, (4.9b) 

where £"^ for i = 1,2,3 are the Euler-Lagrange derivatives associated with £i for 
i = 1,2,3 respectively, which are written down extensively in the Appendix at the 
end of this section on page 142 ff. In equations (4.9) the contributions from R^, 
R°'^Rah, and R"'^'^'^ Rahcd can be easily identified since they are multiplied by 71, 72, 
and 73 respectively. 

We now proceed to examine the evolution of the Bianchi IX model on approach 
to the singularity (occurring at t = 0); this is dictated by the system (4.9) sup- 
plemented by the remaining components of the field equations. The basic idea of 
the asymptotic method is based on the search for existence (or nonexistence) of 
chaotic behaviour in the Bianchi-type IX evolution according to the field equations 
(4.6), which is intimately connected to the nonexistence (or existence) of power-law 
asymptotic solutions of the system on approach to the singularity as t ^ 0. This 
method was first applied successfully by Barrow and Cotsakis [BC89, Cot90] who 
showed that, in the nonlinear theory based on a Lagrangian L = f[R) that is a 
polynomial function of the scalar curvature, the vacuum Bianchi-type IX model 
is nonchaotic and possesses monotonic, power-law asymptotes on approach to the 
singularity. Accordingly, we look for power-law asymptotes that, to lower order, 
have the form 

{a,b,c) = {tP\tP\tP^) (4.10) 

as t ^ 0. For simplicity we define constants q^ by 

3 



E^^ (4.11) 



Substituting the ansatz (4.10) into the Bianchi-type IX field equations (4.9a) and 
(4.9b) (similar terms appear in the (^2)" ^^d (^3)-components) we obtain, for the 
( o)-component, 

(£i)°o = ^t^'k' + («')' - 29'] [^1' - il'f + %'] + [***], (4.12a) 
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+ 2q^q^ - 3q^ - 3{q^f] + [•••], (4.12b) 

(£3)°o = *'l3/ + 3{q^f - Aq\^ + Aq\^ - Qq^] + [***], (4.12c) 

and, for the (^]^)-component, 

(£i)\ = \t-^[Ap,{q' -3)-q'- {q'f + lOg^ - 24] 

x[q^ + {q^f-2q^]+[-ki.i.], (4.13a) 

(£2)\ = ^t-l4pi(gi - 3)(g2 - 1) - (g^)^ - q\q^f + 2{q'f 

+ 6g2gi - 7g2 - 11(^^)2 + Uq^] + [***], (4.13b) 

(£3)\ = lt-^[8piiq' - 3)ipl-piq' + ^l + ^2 _ 2) 

- 2g^ - 2(^2)2 ^ g^3 _ 4^2j ^ [^^^j^ (4 -^3^^ 

where the explicitly written terms correspond to Bianchi type I and [***] are ad- 
ditional terms generated by the Bianchi-type IX potential. We know that the only 
power-law solutions to the Bianchi-type I field equations derived from equations 
(4.6) in a four-dimensional space-time are [Der89, CDGM91]: 

1. the well-known Kasner solution 

3 
ds^ = -dt^ + Y^ef'^{dx'f, with q^=q'^ = 1, (4.14) 

1=1 

where the p^s can be represented in the parametric form 



-s , , s(l -l-s) , . 1 + s 

1 + s + s^ l-|-s-|-s^ 1 + s + s^ 



Pl(^)= 1 , „ , .2 ' P2(S) = -— -— -2, ^3(S) = T— -— -2, (4.15) 



where the Kasner parameter s varies in the range s ^ 1; 

2. the isotropic solution 

3 
'y:2P'('d.TM2. with Til = m = mq = . 



3 1 

ds'^ = -dt'^ + "^t^P^{dx'f, withpi=p2=P3 = -. (4.16) 



1=1 



This plays a key role in determining the evolution of the Bianchi-type IX model 
near the singularity in the fourth-order theory of gravity described by the action 
(4.5): Looking for power asymptotes to the Bianchi type IX field equations amounts 
to looking for Kasner or isotropic asymptotes to these equations. 



13^ 
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We consider the algebraic system {(4.12b), (4.12c), (4.13b), (4.13c)}. Firstly, 
we seek for Kasner asymptotic solutions of the form (4.10), (4.11), and (4.14). Most 
of the additional terms appearing in the equations are "nondangerous" in the sense 
that they grow slower than the t~^ contributions (present in the Bianchi-type I 
field equations): they are unimportant as we approach the singularity. However, 
the following terms, which appear in equations (4.13b) and (4.13c), namely 



15a'' 



2a^ 
6V 



b c 
b^-c 



2aa 



+ 



+ 



62c2 
b 



^2g3 ^3g2 



b\2 

b) + 

2a^bc 



+ 



2ad 



^3g3 ^2^2 ^2g2 ' 



(4.17a) 



and 



BBa" 



7a^ 



b c 
b^-c 



+ 



\2aa 



+ 



62c2 
b 



&\2 
b) + 



^2^3 ^3g2 



a?'bc 6d2 



63 c3 



+ 



62 c^ 



+ 



12aa 
6V 



(4.17b) 



respectively, are of the form t^^+^Pi (the very first terms in expressions (4.17a) 
and (4.17b)) and t-4+4pi ^-^.j^g remaining nine terms in expressions (4.17a) and 
(4.17b)). Since pi < in the anisotropic case all these terms grow faster than t~^ 
as t ^ 0: They will be the dominant ones in the field equations on approach to 
the singularity. Thus in this case "dangerous terms" appear in the Ricci-squared 
and Riemann-squared Euler-Lagrange expressions. This is completely analogous 
to what happens in general relativity as demonstrated by the bkl approximation 
method: At a certain stage the description of the asymptotic dynamics in terms 
of the Bianchi-type I Kasner solution breaks down and one must take into account 
the aforementioned "dangerous terms" in the field equations; the dynamics is then 
described by the Taub solution of the Bianchi-type II model and this corresponds 
to a transition from one "Kasner epoch" to another. Hence, in our case, one should 
retrieve the same kind of oscillatory behaviour as that found in general relativity 
by BKL on approach to the singularity as t ^ 0. One can indeed explicitly check 
that the bkl solution [BKL70] 

2 -2piA 



cosh(2piAr) ' 
62 = 6(0)2 exp[2A(pi + P2)t] cosh(2piAr), 
c^ = c(0)2exp[2A(pi +pz)t\ cosh(2piAr), 



(4.18) 



where dt = a6cdr and A is a constant, satisfies the field equations (4.9a) and 
(4.9b) for Bianchi type IX to leading order on approach to the singularity, wherein 
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only the "dangerous" and the "Kasnerian" terms — those containing four dots — have 
been retained. 

On the other hand, if we introduce the asymptotic isotropic solution (4.16) in 
the algebraic system {(4.12b), (4.12c), (4.13b), (4.13c)}, it appears that all the 
terms in the equations are nondangerous. This proves the possibility of reaching 
the cosmological singularity in a monotonic, nonchaotic way. 

The conclusion from this analysis is that the fourth-order Bianchi-type IX equa- 
tions admit no anisotropic monotonic power-law solutions all the way to the sin- 
gularity, but only an isotropic one, given by (4.16). 

Now it is known that each vacuum solution of general relativity also satisfies 
the equations (4.6) in space-time dimension n ^ 4 since the i?-squared equation 
(cf. (2.20a)) and the Ricci-squared equation (cf. (2.20b)) are obviously satisfied 
if Rab = R = and because the variation of the quadratic Lovelock Lagrangian 
identically vanishes in space-time dimension n < 5. However, the solution space of 
higher-order gravity is larger than in general relativity; in particular, the Kasner 
solution (4.14) is not the general solution of the Bianchi-type I model in the fourth- 
order theory based on the action (4.5). Therefore, it is natural to address the 
question of the stability of the exact polynomial solutions (4.14) and (4.16). This 
is achieved through a perturbation analysis. 

Consider for instance the pure Riemann-squared theory and choose the param- 
eterisation 

r = ln(t), a = ln(a), /3 = ln(5), 7 = ln(c). (4.19) 

The linearised field equations for small perturbations ei, £2, and £3 of a = piT, 
P = P2T, and 7 = P3T respectively are 

Xi = X2, 

X2 = X3, 
9 
X3 = '^fi{Vl,P2,P-i)Xi, 

i=l 

X4 ^ X5, 

X5 = X6, (4.20) 



3 9 

\\ „' „,. JXi 

i=l i=A 



XQ = ^fi+6{P2,P3,Pl)Xi + ^fi-3{p2,P3,Pl) 



X7 = Xg, 
XS = Xg, 
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6 9 

Xg = '^fi+3{P3,Pl,P2)Xi + '^fi-6{p3,Pl,P2)xi, 
i=l 1=7 

where we have defined 

xi := e'l, X4^ := e^, xj := e's, 

and 

/i (pi ,P2,P3) = {2pl - 2pip2 - 2pip3 + 3pi +pI+P2+pI+P3-2) 

X {Pl +P2+P3-3'), 
f2{Pl,P2,P3) = -Pl(4p2 + 4p3 - 9) -P2(2P3 " 7) + pj + Jps - 11, 
f3{Pl,P2,P3) = -(2pi + 2p2 + 2p3 - 6), 

h{pi,P2,P3) = [2p2{P3 - 2) + 3p2 +P3 + 2P3 " 5]pi -pI{p2 + 2^3 " 5) 

,3 , Q 2 



P2(pi + i)-2pi + 3p: 



2) 



h{Pl,P2,P3) =Pl{2p2 + l) - PJ - pI + P2, 

f%{Pl,P2,P3) = 0, 

h{Pl,P2,P3) =Pl[2p2{P3 + 1) +P2 + 3P3 -4P3 " S] - P? (2p2 + P3 " 5) 

-P2P3- 2pi + 3pi-p3, 
h{Pl,P2,P3) =Pl(2p3 + l) -Pl -P3+P3, 
h{Pl,P2,P3) = 0. 

The characteristic polynomial related to the differential system (4.20) takes the 
forms 

_(A + i)(A-2)'(A-3)A2 

for the Kasner solution (4.14) and 

-1(2A-1)^(2A-3)^(2A-5)(A + 1)(A-1)^ 

for the isotropic solution (4.16). Hence the corresponding evolution laws for the 
perturbations are 

ej(T) ~ Cjo + Cji exp(-T) + o^T + as exp{2T) + Cj4 exp(3r), (4.21a) 

^i{T) ^ c'iQ + c-i exp(-T) + c-2 exp(r/2) + 4 exp(r) 

+ 4 exp(3r/2) + 4 exp(5r/2), (4.21b) 
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for i = 1,2,3, with the conditions 



Cll = kpi, C21 = kp2, C31 = kp3, 

P1C12 +P2C22 +P3C32 = 0, 

Ci4 = C24 = C34 = 0, 

Cll — C21 — C31 

^12 + C22 + C32 — U, 
Cl3 + C23 + C33 — U, 



k 
2' 



'15 



/ / p, 

^25 — C35 — U, 



where A; is an arbitrary but — as it stems from the perturbation analysis — infinitesi- 
mal constant. Both perturbations given in equations (4.21) grow exponentially for 
T -^ -|-oo: The corresponding solutions are unstable. The same conclusion seems 
to apply in the neighbourhood of the initial singularity, i.e. as T ^ -co, due to 
the presence of the terms Cjiexp(— T), Ci2T and 4^exp(— T). However, the mere 
role of the terms involving exp(— T) is to shift the position of the initial singularity 
on the time axis. More explicitly, looking at the scale factor a and assuming that 
A;exp(— T) ^ 1 we obtain 



a{T) + ei (T) ~ qq + piT -|- cn exp(-r) -|- ci2r 



Qo + pi In 



exp{T){l + kexp{-T)) 



+ C12T 



Qo + Pi ln(t + k) + C12 ln(t) 



(4.22a) 



in the case of the Kasner metric and 



a(T)+ei(r) 



ao + -T + c'li exp(-r) 



ao + - In 



exp(T)(l-F/cexp(-r)) 



ao + - ln(t + k) 



(4.22b) 



for the isotropic solution. Similar relations hold for the scale factors P and 7. 

In conclusion, due to the presence of the divergent logarithmic term in (4.22a) 
the Kasner metric (4.14) is unstable on approach to the singularity, but the isotropic 
metric (4.16) is stable. To resume, the important facts of our analysis are: 

• The quadratic theory based on the action (4.5) admits no anisotropic polyno- 
mial solutions other than the Kasner solution (4.14). In general relativity this 
is the general-thus stable — solution to the Bianchi-type I model; this leads in 
the Bianchi-type IX case to a shift from a Kasner solution to another with a 
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different set of Kasner exponents, thereby giving rise to the characteristic os- 
cillatory behaviour of the mixmaster universe. Here we have shown that the 
Kasner polynomial solution is itself an unstable solution of the Bianchi-type I 
fourth-order field equations. This implies that an oscillatory behaviour based 
on Kasner asymptotes cannot be generic in the Bianchi-type IX fourth-order 
dynamics. The typical mixmaster oscillatory behaviour is of zero measure 
amongst all possible behaviours since it is unstable with respect to small 
perturbations.^^ 

• There exists one stable, isotropic and monotonic solution, given by the metric 
(4.16), which attracts sufficiently close trajectories in the phase space. Such 
a situation is not met in general relativity, for the isotropic metric (4.16) is 
not a solution of Einstein's vacuum field equations. 

Inclusion of matter fields does not alter the conclusions since they become 
dynamically negligible near the singularity: In general relativity matter fields are 
negligible with respect to the dominant metric terms, which grow as t~ as one 
approaches the singularity; in the fourth-order dynamics the metric terms grow 
typically as t~^ as t ^ 0; hence matter terms do not influence the Bianchi-type IX 
evolution. 

Remark. When the space-time dimension is greater than four, the Kasner metric is 
no more a Bianchi-type I solution to the fourth-order theory derived from the action 
(4.5) [CDGM91]. Therefore, it seems reasonable to conjecture that, in contrast to 
the general nondiagonal case of multidimensional spatially homogeneous cosmology 
in general relativity, it is impossible to build a mixmaster universe in the fourth- 
order Kaluza-Klein theory based on the bkl approximation scheme. 

Appendix 

We give here the mixed components of the Euler-Lagrange derivatives (2.20), spe- 
cialised to Bianchi type IX. 



lii^i'' 





7i(4"ada~^ + iaba^^b^^ + 4aca'^c^^ - Td^a''^ - 4ad^a~'^ - Adha^^h' 
+ Adb^a'^b"'^ + Sdhca'^b'^c^'^ - 4dca"^c~^ - 4d^6a"^6"^ 
+ 4:dc^a-^c-^ - Ad^ca-^c-^ + Aa^ha-%-^ - Qd%'^a-%-'^ 
- Ad%ca~'^b-^c-^ + 4d2 ca-^c-i - Gd^c^a-^c-^ - Ad^a'^ 



^^This is supported by the numerical analysis of the Bianchi-type IX field equations in fourth- 
order gravity [CDDQ93]. 
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+ 8d6ca^^6~^c~^ - 4d6^a"^6"^ - Aab^ca'^b^'^c'^ - 2hba'^b'^ 
+ 8abca''^b'^c~^ - 4abc^a~'^b~^c~'^ - dbab'^c'"^ - aba'^bc"'^ 
+ 2d6a"^6~^c~^ - aba~^bc~'^ - 2aba'%~^ - Aae'a'^c'^ 
+ 'iaba~^b~^c^ + Aaca~^c^^ - acab^'^c^^ - aca'^b^'^c 

- 2aca'^c'^ + 2aca'^b''^c''^ + 3aca~^b'^c'^ - 2aca'^c'^ - Abb^b'^ 
+ A'bbb-^ + Abdb-'^c~^ - 2pb-^ + Abc^b'^c-^ - Abcb'^c'^ 

- Ab^db'^c"^ + Ab'^cb-^c'^ + 2b^a^b-^c-^ - 6b'^c^b'^c~^ 

+ 2b'^a-^b-^c'^ - 2i?a-'^c-^ - AbcH-^c-^ - Ab'^b'^ + 46"c fo-^c-^ 
+ 3bca'^b~^c~^ - bca-%c~^ - bda~^b~^c + 26ca~^6"^c"^ 

- bca~%~^c - 2bcb~^c~^ - 2bcb~^c~^ + 4c cc"^ - 2c^c~'^ - Acc^c'^ 

, r,-2 2i,-2 -4 , r,-2 -2i2 -4 r,-2 -2i,-2 ^ -2 -4 , 1 4i,-4 -4 

+ 2c a b c + 2c a be — 2c a b — Ac c + ^a b *c 
_ laH-\-^ - \a%~^c-^ - \a~%''c-^ + \a-%-^ - ^a'^-'c' 
+ 1^-2^-2 + ia-Vc-4 - ^a-%^c-^ - \a'%~^c^ + ia'^ft-V 

72(£2)°o = 

2'aaa~ + a'6a~ 6~ + 'aca" c~ — a a~ — 2aa a~ + aaba^ b~ 

+ ddca~^c"^ - d6a"^6"-^ + aiPa''^b~'^ + 2d6ca"^6"^c"^ 

- dca^-'^c"^ + dc^a^^c^'^ - d^ba^^b^^ + d%a^%^^ - d^ca^^c^^ 

0-212 -2r-2 •2;- -27-1 -1 0-2-2 -2 -2 , •2" -2 -1 

- 3a a — a oca o c — 6a c a c + a ca c 

+ d^a-^ft^^c-^ + d^a-^^-^^c^^ + d%-'^c-'^ + d'ba-^b-^ - 2d^a'^ 
+ dbba~^b~^ - db^a'^b"^ - di?ca~^b~'^c'^ + 2d6m~^6"^c"^ 
+ 2d6ca~^6"^c"^ - 2dbab~^c~'^ - 2dba'Hc''^ - db(?a'^b'^c''^ 
+ 2dba~^b~^c^ + dcca"-^c~^ - dc^a"-^c"^ + d'c a~-^c"^ 

- 2dca6"^c"^ - 2dca"^6"2c + 2'bbb~^ + 2dca~^62c~^ 
+ 'bcb'^c-^ - 2bbH-^ + bbcb'^c'^ - b^b-^ - bcb-^cr^ 

- b^db'^c-^ + b'^cb~^c'^ - Sb'^c^b'^c'^ + bc^b'^c-^ + b'^a~^b-*c^ 
+ 6^025-4^-2 ^ 52^-2^-2 _ 2525-4 _ 5c36~ic-3 + bddb'^c-'^ 

+ b'cb~'^c~^ + 2bca%~^c~^ + 2 ccc"^ - 2bca'Hc'^ - 2bca'H'^c 
_ c2c-2 + c^a^ft-^c-^ + c^a-%^c-^ + c^a'^ft-^ _ Tcc^c'^ 

- 2i?c-^ + fa^^-^c-^ - \c?b-''c-^ - \a%-^c-'' + la-25-2 

- \a-H^c-^ - \a-%-^c^ + \a'^c~^ + fa'^^^c-^ - \ar%-^c^ 
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73 [^-i 





73(4ada~^ - 2a^a"^ - 4ad^a~^ + 4dd6a~^6~^ + 4:ddca~'^c~^ 

- 6dH^a-^b-^ - 6h^c^a-^c-^ + la^a-^'^c'^ + 2d2a-^6-2g2 
+ 6d^b-^c-^ - Ad^a-' + 4d66a-^6-2 _ 'jabab-^c-^ 

+ 2dba-^b-^ - rdba'^bc-'^ - Idba'^b"^ c-"^ + Idba'^b'^ 

+ 4dcca"^c~^ - 7dca6~^c~^ - 2dca"^6"^c"^ + 5d6a"^6"^c^ 

+ 2dca"^c"^ + 5dca~^6^c"^ - Jdca'^b^'^c + 2dca"^c"^ + Abbb'"^ 

- 26^6-2 _ 4552^-3 + 2&2a26-^c-2 + mdb~^c'^ - Qb^c%-^c-'^ 

+ 262a"25-4^2 ^ 652^-2^-2 ^ 4ftcc6-ic-2 + 56ca26-3c-3 _ 4^,2^-4 

-2f,„-3 _ 7/,A^-2f,-3^ I oj,AA-l^-3 _ o/,A^-2/,-l^-l 



Ibca-^bc-^ - Ibta-^b-^c + 2bcb~^c-^ - 26m"^6 



c 



+ 2bcb-^c-^ - tSc'"^ - Acc^c'^ + Ic^aH^'^c"' + 4c cc'^ 

_ 3«2^-2^-4 _ 3^-2^2^-4 + 3^-25-2 _ 3^-2^-4^2 ^ n^-4^4^-4 
+ 3^-2^-2 _ 3„-4^2^-2 _ 3^-4^-2^2 ^ 1^-4 ^ 3^-2^-2 

- 7i(4'd'a~^ - 8ada^2 + 8'dba^^b^^ + S'd'm^^c"^ + Sdd^a^^ - Gd^a^^ 

- 20dd6a^2&~i - 20ddca"2c"^ - 4d62a"^6~2 + 8d6a"^6"^ 
+ Sdbca^^b^^c^^ + Sdca^^c"^ - Adab^'^c^'^ - Adc^a^^c^"^ 

+ 4da~^62c~2 + Ada~^b~^c^ - 8da~^ + Sd^ca'^c"^ + Sd^ba^^b'^ 

- Sa^ba-^b"^ + 2d^b^a'^b''^ - 8d^ca~'^c'^ - Ud'^bca'^b'^c'^ 
+ 2d^c^a~^c''^ - 6d^a~%'^c'^ + 4d 6 a'^^-^ - Qd'^a'%'^c^ 

+ 12d2a-'^ - 2a^b~'^c-^ + 4d6ca"^6"^c~^ + 4dfe^a"^6"^ - 80660*^6^^ 

- Adb'^ca'^b-'^c-^ - Adbc^a'^b-^c"^ + Adbab-^c''^ + Adbca~^b'^ c'^ 
+ Udba'^bc'"^ - Adba~^b~^c^ + 4d "ca^^c"^ - Sdba'^b'^ 

- 8dcca'^c~'^ + 4dca6~2c~^ - Adca~^b'^c~^ + 4dc^a~^c~^ 

+ 12dca"^6~2c + 4'5'6~^ - Abbb'"^ + 8'6c6"^c~^ - 8dca~^c~^ 

- 262^-2 _ sbbcb-^c"^ + 12bcb-^c-^ - AbcH-^c-"^ + AbbH'^ 

- ba~Hc'^ - 2ba-%-^ + ^ba-%-\^ + 2bb~^c''^ - ba%~'c-' 

+ Ab^cb-^C-' - Ab^cb-^C' + 262c26-2^-2 _ 4^2^-2^-4^2 _ 355-3 
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- 462a-2c-2 - Ab^a%-^c-^ + 8b%-^ + Sb-fb-^c-^ + Ab&'b-^c-^ 

- Jbca^b'^c'^ - 8b'ccb-^c~^ + ^ca^Hc'^ + dbca^'^b-^c 

- 2bcb~^c~^ - 2bcb~^c-^ - 4'ccc~^ - 2c^c"^ - 2bca~^b~^c-^ 

+ 4cc^c"^ - ca^b~^c~^ - ca'H^'^c - 2'ca~'^c~^ + 2cb''^c''^ + 4'c'c~^ 

+ 3ca-262c-3 _ 2cc-3 - 40^0-25-2 ^ g^2^-4 _ 4c2^-2^2^-4 

+ la-26-4c2 _ la-2c-2 _ 3^-454^-4 ^ 3^-4^2^-2 _ 1^-2^-2 
_ 3«-45-4c4 _ 9^-4 ^ 3^-4^-2^2 ^ 1^-2^-2 ^ 3^-4 ^ 3^-4). 



-3 



72(£2)\ = 

- 72(2'a'a~^ - 4a aa"^ + 4'a6a~-^6~-^ + 4'aca~^c~^ - 3a^a~^ + 4ad^a" 

- Tdaba^ b" — 7daca~ c~ + 3dba^ b^ — 2db a" b~ 

+ 4a6ca"^6"^c"^ + 3am"^c"^ - 2ac^a"^c"^ + 2aa6"^c"^ 

+ 2da'^b'^c'^ + 2da~^b~'^c^ - Ada~^ + 2d%a'%'^ + 2d^ ca~^c"^ 

- 2d%a-'^b-^ + d^b^a-^b-^ - 'id^bca-'^b'^c'^ - 2d^ca~'^c'^ 

+ d^i?a~'^c'^ - 3d^a-%^c-^ - Sd^a'^-^c^ + Gd^a'^ + d%~^c'^ 
+ d'?) a"^6"^ - 4:dbba~^b~'^ + dbca'^b'^c'^ + 2d6^a~^6~^ 

- 2db'^ca^^b'^c-^ + d6ca~^6"^c"^ - 2d6c^a"^fe"^c~^ - 2dbab~^c~'^ 
+ 6dba~^bc~'^ - Adba^^b^'^ - 2dba^^b^^c^ + d'c a~^c~^ 

- 4dcca~-^c~^ + 2dc^a"^c^^ - 2dca6"^c~^ - 2dca~^6^c~^ 
+ 6dca~^6"^c - 4dca~^c"^ + "bb~^ - bbb~'^ + 26' c6~^c~^ 

+ UH-^ - 2bbdb-^c-^ + 3bcb-^c-^ - bc%-^c-^ - 2ba%-^c-^ 

- 2ba~Hc''^ + 2ba~%-^c^ + Ir'db-^c^^ - }?db''^c^^ + b'^d'^b~'^c~^ 
+ 62a26-4c-2 _ 362a-26-4c2 _ 3b^a-^c~^ + 2625-^ + 26c ft-ic-^ 

- 2bccb-^c~'^ + bc^b-^c'^ - 2bca%-'''c-^ + Qbca~Hc~^ 

+ Qbca~%~^c + "c'c~^ - 'ccc~'^ + cc^c~^ + 200^26^0"^ - 2ca26~2c~^ 

o" -21-2 I -2 21-2 -4 Q -2 -2;,2 -4 , o-2 -4 o -2 -2i-2 

- 2ca b c + c a b c — 3c a be +2cc — 3c a 

+ }^a%-^c-^ - laH-^c'* - la^b-^c-^ - ^a'^'^ + ^a-'b'c~^ 

+ la-26-4^2 _ 1^-2^-2 _ 9^-4^4^-4 ^ 3^-4^-2^2 ^ 3^-4^2^-2 
_ |a-46-4^4 _ 3^-4 ^ 1^-2^-2 ^ 1^-4 ^ 1^-4). 



73 (£3)', 



73(4'd"a ^ - 8"d"da ^ + 8060 ^b -"^ + 8'dca ^c ^ - Gd^a ^ + 8dd2a 
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- Sdaba^'^b^^ - Sadca^^c"^ + Adba^^b^'^ - AaiP'a^^b^'^ 

+ 8d5ca"^6"^c"^ + Adca^'^c'^ - 4dc^a~^c"^ + 12da6"^c"^ 

+ 4da~^b'^c~^ + Ada'^b'^c^ - 8da"^ + 2d^b^a''^b-^ + 2d^c^a'^c~'^ 

- 6a^a-%'^c-'^ - 6d^a-%-'^c^ + Vld^a~^ + <od%-'^c~'^ - %dbba'H~'^ 
+ Adb^a'^b'^ - 4db^ca-^b-^c-^ - 4dbc^a~^b-^c-^ - Udbab'^c-'^ 
+ I2aba'%c~'^ - 8dba~^b~^ - Adba~^b~^c^ - 8dcca~^c~^ 

+ 4dc^a"^c"^ - I2dcab''^c~^ - Adca'^b'^c'^ + Udca'^b^'^c 

- 8dca-^c-^ + 2Pb-^ - IbaH-^c-'^ - 7ba~^bc-^ + 2ba'^b'^ 

+ 5ba-'^b-^c^ - 2bb-^c-^ + 266"^ + 2i?i?b-'^c-'^ + S}?a%~^c-'^ 

- Sh^a-'^b-'^c^ - Si?a-'^c~'^ - bca^b-~^c~^ + Ibbda-^bc'^ 

+ 26ca"^6"^c~^ + 15bca~^b~^c+2bcb~^c-^ + 26c6"^c"^ + 2c^c~^ 

- 7ca^6~^c~^ + 5ca~%'^c-^ - 7da''^b''^c+ 2ca~'^c-^ - Tcb-'^c'^ 
+ 2CC-3 + Sc\''b~^c~* - Sc^a'^^c-' - 8c^a-%~^ + fa%-^c-^ 
_ ^b-^c'^ - yb-^c-^ + |a-262c-4 - |a-26-2 + fa^^^-^c^ 



4" 



+ |6-V2 + i6-4 + ic-4). 



4.3 Quadratic Hamiltonian cosmologies 

4.3.1 Canonical formalism and Misner's parameterisation 

We henceforth assume that the Bianchi-type metrics be diagonal so as to simphfy 
the investigation.^^ According to the possible forms of the Bianchi metrics, we start 
from the spatial element d/^ = hij{t)uj'^uj^ for which several parameterisations are 
equally acceptable; one usually adopts either hij = diag[a^(t), 6^(t), c^(t)] or hij = 
diag[e^"(*'', e"^^^^' , e^'''^*-'] as possible ansatz, with proper time or logarithmic time. It 
is useful to represent the anisotropic scale factors in terms of the logarithmic volume 
H and orthogonal anisotropic 'shears' (3±. These prescriptions, first introduced by 
Misner, are simply^^ 



a = II 



+ /3+ + V3/3_, (3 = fi + P+-V3P-, -i = 11-213+, (4.23) 



^^This assumption might be wrong for some Bianchi types; the question of whether the Bianchi 
metrics are diagonalisable or not in generalised theories of gravity remains still unanswered. 
"See, e.g., [Mis94]. 
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and imply that the matrix formed with the (3's be traceless. Adopting Misner's 
parameterisation we may write the three-metrics of Bianchi models in the form 



d/2 = e^>^ 



^2(/3++V3,3_)(^l)2^^2(;3+-V3/3_)(^2)2^^_4/3+(^3)2l^ ^^^4) 



where n, /3+, and /3_ are functions of time only and a;* for i = 1,2,3 are the 
invariant differential forms that characterise the Bianchi type under study. 

We rely on the Ostrogradsky canonical formalism of quadratic gravity theo- 
ries developed in Subsection 3.3.4, which is completely generic, that is, applicable 
to space-times without isometrics. Henceforth, we intend to specify this general 
Hamiltonian formalism to spatially homogeneous cosmologies. To this aim we 
perform an appropriate canonical transformation that renders Misner's parame- 
terisation manifest: It maps the original set of Ostrogradsky canonical variables 
{hij, Kij-jp"^^ ,V'^^} onto the set^^ {^, P±, Kq, K±;Il^,Il±,Vo,'P±} and is explicitly 
defined according to the following prescriptions: 

and 

\/3 ^3 3 

where the new momenta W^ are given by 

n\ = ^(2n^ + n+ + V3n_), 
n22 = ^(2n^ + n+-V3n_), 



and the new -fC's and P's by 



Kij = Kjj + -^ hij , K^) = -^ diag {K+ + VSK- , K+ - VSK^ ,-2K+), 



pij = pTij ^ T^f^ij -pTi ^ 1 diag(P+ + VSV-,V+ - V3V-,-2V+). 

In terms of these new variables the original action in Hamiltonian form (3.179) 
becomes 

s = Idt [n^/i + n±/3± + VoKo + v±k± -n:k- m^K^ , (4.25) 



^^The symbol S± stands for {S+, S-} and the notation A±B± must be understood as A+B+ + 
A-B^. 
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where the spatial integration Jlo^ f\ uP f\ uj"^ has been performed. The super- 
Hamiltonian constraint is, in general, a quite complicated expression that is ob- 
tained by specifying the canonical Hamiltonian density (3.180) in terms of Bianchi 
isometrics. For instance, the Bianchi-type IX super-Hamiltonian in the pure qua- 
dratic gravity theory is given explicitly by the expression^^ 



CKix = ae^C • C - P • R 

+ ^V+{Kl - Kl) + ^K.{K^V^ - n,) 

D 6 D 



(4.26) 



where we have denoted 



P . C — p^M/^ . ., r<nij 
y^ y^ . — e '^nnk'^ 



and 



mj'fe'- 



;K 



2e^'^+ (1 - cosh(4\/3/3_)) - 3e' 



8/3+ 



+ Kl 



4e"2/3+ cosh(2\/3/3_) - e^^+ (l + 7cosh(4\/3/3_)) - -e^ 



8/3+ 



aVsK+K. fe"^^+ sinh(2V3/3_) + e'^'^+ sinh(4\/3/3_ 



(4.27) 



p.R:=pTii(3)^^ 



'«j 



-2^ 



Lf''^|^-^^-|^-#''»-(''-^-) 



(4.28) 



and where the function V(/3+,/3_) is given explicitly in Table 4.4 at the type IX 
entry. 

As regards the super-momentum constraint, we reiterate that one may consis- 
tently choose the shift vector to be zero in all cases but type VIq (cf. the remark 
on page 132). To simplify the analysis we shall consider those canonical systems 
that correspond respectively to the pure -R-squared and Weyl-squared variants of 
the general quadratic theory. 



^^The ensuing canonical equations are far more simpler to write down than the intricate Euler- 
Lagrange expressions given in the appendix of Section 4.2. 
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Table 4.4: Potentials for types I, II, VIIq, VIII, and IX. 



Type 


V(/3+,/3-) 


I 
II 




exp(-8/3+) 


VIIo 


2 exp (4/3+) [cosh (4^3/3-) - 1] 


VIII 


exp(-8/3+) + 2exp(4/3+)[cosh(4^/3/3_) - 1] 




+4exp(-2/?+) cosh(2V3/3_) 


IX 


exp(-8/3+) + 2exp(4/3+)[cosh(4V3/3_) - 1] 




-4exp(-2/3+) cosh(2V3/3-) 



4.3.2 Pure R^ Bianchi cosmologies 

Reaucea Hamiltonians ana exact solutions 

We first study the pure i?-squared Bianchi cosmologies, the field equations of 
which were analysed by several authors with the aim of examining the asymp- 
totic behaviour of the mixmaster model in that specific quadratic variant; see, 
e.g., [BS89, BC89, SZ93, Spi94]. In contradistinction to these works, we lay our 
analysis on the canonical system rather than the field equations. Our goal is to ap- 
praise the achievements of the Hamiltonian formalism; in particular, to determine 
whether it simplifies the investigation, and to compare our results with those that 
are published. 

The canonical action (4.25) reduces to 



S 



dt 



n^/i + n±/3± + VoKo - n:k 



(4.29) 



due to the existence of the primary constraint (3.139b), which translates into V± ~ 
in terms of the new variables introduced in the preceding subsection. The super- 
Hamiltonian is 









Kolia - ^K±Il± 



(4.30) 



where V(/3+,/3_) denotes the potential of the Bianchi type considered (cf. Table 
4.4), which stems from the three-dimensional scalar curvature, and the coupling 
constant (3 of the -R-squared term in the action (3.169) has been set equal to one. 

We must apply the Dirac-Bergmann consistency algorithm to this constrained 
system, the Poisson bracket being of course defined with respect to the new vari- 
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ables; or, equivalently, we can translate the results of Subsection 3.3.4 in terms 
of those variables. The Dirac Hamiltonian density is i^D = NJi + X±T'±, with 
Lagrange multipliers \± . Conservation of the primary constraints V± ~ brings 
forth secondary constraints X± ~ 0, viz. 



P± wO 



X± = VoK± 



V2 



n±«o, 



the time evolution of which determines the Lagrange multipliers \±. Since we have 
the nonzero Poisson bracket {T'±,X±} ~ ~'Po the above primary and secondary 
constraints are second class; hence the associated degrees of freedom, i.e. T'± and 
K±, are unphysical and can be removed. The super-Hamiltonian (4.30) becomes 
simply 



3 










and the ensuing canonical equations are 



/^ 



4nk.. 



'^ 6 Po 



Ko = N 



V3/ 2 , n^+n? 



+ ^^oe-3'^-^e-2A'V(/3+,/3- 



n, 



N 



2 



^/3 

Vo = -\N{AKoVo-Ii^?,. 
o 

This canonical system possesses the first integral 



(4.31) 

(4.32a) 
(4.32b) 

(4.32c) 

(4.32d) 
(4.32e) 

(4.32f) 



n„ 



^ O I O '•'O 5 



(4.33) 



where ko is an arbitrary constant. 

In the usual minisuperspace approaches of Hamiltonian cosmology one goes 
further by firstly fixing the lapse function, then solving the super-Hamiltonian for 
an adequately chosen canonical variable. The outcome is a nonvanishing 'reduced' 
Hamiltonian density. We proceed likewise and firstly consider space-times of non- 
constant scalar curvature. 
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Nonconstant scalar curvature. First of all, we fix the time gauge by setting 

R = pe\ (4.34) 

where p = ±1 (this allows for a separate discussion on space-times with positive 
or negative scalar curvature). According to the definition (3.174) the variable Vo 
becomes 



v3 3^+t 



Vo = -—pe 



(4.35) 



Differentiating this expression and making use of the canonical equations (4.32) and 
the first integral (4.33) we obtain the form of the lapse function that corresponds 
to the gauge choice (4.34), namely 






3P „3/i+t 



(4.36) 



Solving the super-Hamiltonian (4.31) for Ko and performing the canonical trans- 
formation 



^ 



-^{^-t) 



n„ 



2n„ + -iKoVo 



we obtain, after little algebraic manipulation, the reduced form of the Bianchi 
i?-squared canonical action (4.29), namely 



S= fdt[Uf,fi + U±l3±-H 



where the reduced Hamiltonian density is 



H 



4fco 



' ^' ^' ^e2^fpe^ + V(/3+,/3_))+fc2 



4n^ - n^ - n 



The canonical equations are readily derived from the action (4.37), viz. 



A = — n/.' 






2 

/Co 

1 

ZrCo 

3p2 



'' IGfco 

n± = 



n±, 

[3/>e^ + 2V(/3+,/3_)]e2^ 



3^1 o^^ 



lefco dp± 



(4.37) 



(4.38) 



(4.39a) 
(4.39b) 
(4.39c) 

(4.39d) 
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They are equivalent to the autonomous system of second-order coupled ordinary 
differential equations 

By mere inspection, we see that the system (4.40) decouples when the potential 
term vanishes, i.e. for type I; hence we carry on to analysing the corresponding 
type I system. 

From the canonical equation (4.39d) with zero potential, we conclude that the 
momenta II-i- are constants of motion. Hence we can readily integrate the equation 
(4.39b): the variables f3± are simply linear functions of t, namely 

/3±(t) = -^t + 6±, (4.41) 

where the constants of integration b± can be removed by a change of scale of the 
spatial coordinates. Furthermore, the equations (4.39a) and (4.39c) reduce to 



2 



A = — n,„ (4.42a) 



K = ^e*. (4.42b) 



This is easy to solve and we obtain 



.3.W . 6-^feg-o "°^^"" (4.43a) 



fcoWo 1 + rioC^'^"* 
3 1 — noC^ 



n^(t) = o -, ^ ^2.^v (4-43b) 



where Uo is a new constant of integration and (^^ := |[l-|-(n^-|-n^)/A;o]. Therefore, 
the general Bianchi-type I metric is given by 

ds2 = _|^e3^We-Mi2 + e^We-* ^ e^'^^^d.x'f , (4.44) 

where the constants Vi for z = 1, 2, 3 satisfy 

Now, it is interesting to compare the form (4.44) of the general solution with 
those representations that were obtained by Buchdahl [Buc78] and independently 
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by Spindel and Zinque [SZ93] from a direct analysis of the Bianchi-type I higher- 
order field equations. In the former article the time coordinate is defined as t[g] = e* 
and the logarithmic volume as //rgi = ^(// — t) (with these relations one easily 
passes from the expression (4.44) to Buchdahl's solution); in the latter the time 
coordinate is given by trgz] = t whereas the variable Arg^i coincides with 3//. Upon 
a mere redefinition of the constants of integration in the general solution (4.43a) 
we recover Spindel and Zinque's solution for Arg2i(t), namely 






gA[,,](t) ^ g3M(t) ^ _q2^2 z (4.45) 

where Q^, A^, and a^ are positive constants defined in terms of p, ko, uJq, and rio 
by 

Q •=4'^o, ^ -=4^1' " •=^^' 

respectively. The proper time r is obtained, up to constant factors, by integrating 

dr _ el(«-i)* 

dt ~ ^l-pX^a^eQ^' ^^'^^^ 

We thus recover the two distinct behaviours of the general solution (4.45) found 
by Spindel and Zinque [SZ93] whose analysis of the Bianchi-type I field equations 
generalises Buchdahl's work [Buc78]. Strictly speaking, if p = +1, i.e. positive 
curvature, then the function in the right-hand side of equation (4.46) exhibits a 
vertical asymptote at e'^^°° = A^^a^^ corresponding to r ^ +00; this behaviour 
features an asymptotic de Sitter space-time. On the other hand, if p = —1, i.e. 
negative curvature, ^^ then integration of equation (4.46) for t G M yields a finite 
proper time after which the Bianchi-type I universe recollapses. 

Remark. The metric (4.44) becomes isotropic if and only if II-i- = 0, i.e. Wo = |; 
that is, up to rescaled coordinates, 

2t jr2 
d.^ = ! dt' + ^. (4.47) 

(l-noe3*) (l-noe3*)3 

Constant scalar curvature. When the scalar curvature is constant the field equa- 
tions of the i?-squared variant are satisfied by an arbitrary Einstein space, i.e. 
Rab = \Rgab = ^9ab with A Constant. The problem of determining the general 
form of the metrics corresponding to Einstein spaces was undertaken by Petrov 
who proved, in particular, the following result [Pet69, p. 84].^^ 

^'^This is the sole case considered by Buchdahl [Buc78]. 

^*We have modified the original statement in accordance with our conventions and corrected a 
misprint. 
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Theorem 4.3.1. The metric 

n— 1 
ds2 = _(dx")' + ^/,(x")(dx^)' 



j=l 



de£nes a n-dimensional Einstein space Rah = -Rgab = ^dab if the functions fi for 
i = 1, . . . ,n — 1 are chosen as follows: 

{ 2 2aj . 

sin "-1 fax") tan " (Iq-x") if A < 0, 
2 2i (4.48) 

sinh^^(/3x")tanh~(i/Jx") if A > 0, 

where a, j3, ai, and Pi are constants; and if A = one obtains either an 5"", i.e. a 
space of constant curvature, or the generalised Kasner metric^^ 



71—1 n— 1 



/,(x") = (x")'^\ witi E^^ = 1 = E^^ 



We demonstrate how the closed-form solution (4.48) can be derived from the 
canonical formalism, in the case n = 4 and for Bianchi type I.^" First of all, 
observe that a constant scalar curvature, R = 4A, entails the vanishing of the 
constant ko in the first integral (4.33) owing to the fact that the momentum Vo 
reduces to Vo = — 2-v/3Ae^^; hence we have 11^ = KqVo- We are free to fix a time 
gauge by choosing a specific lapse function. To recover Petrov's representation of 
Theorem 4.3.1 we could either start anew from the canonical equations (4.32) with 
zero potential and A^ = 1 or, equivalently, choose N = VsVo and perform the 
transformation from the coordinate time t to the proper time r on the result. In 
both cases we obtain the differential equation 

p; - c^2p^ = 0, (4.49) 

where a prime denotes differentiation with respect to r and lo'^ := 3A. According 
to the sign of A there are two distinct solutions, which involve either trigonometric 
or hyperbolic functions, as this is expected from Theorem 4.3.1. For instance, 
if A > 0, then equation (4.49) integrates and gives Voij) = po sinh(u;oT), where 
p1 = |(n^ + li^_)/io1. The anisotropic shears are obtained upon integrating 

6po 



^^If A = and n — 4, then the Kasner metric is the general solution. 

^°The simple procedure given here yields the same results as Buchdahl's modus 



[Buc78]. 
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that is, 



/?±(r) 



GuJoPo 



In 



tanhf — ^ 
2 



(4.51) 



Therefore, the general form of the metric can be written, up to rescaled constant 
factors, as 

3 2i^ 

ds2 = -dr2 + sinhi(L^oT)^[tanh(i^)] "° {dx')\ (4.52) 



where the parameters i^i satisfy the identities 

3 3 



Y.'y^ = 0, 



Y,.^ = \ul = 2K. 



1=1 



i=l 



We have thus recovered Petrov's representation (4.48) for the positive-curvature 



case 



21 



Remark. As this was first noticed by Buchdahl for a negative scalar curvature 
[Buc78], if one defines the new parameters r* := — tanh(2i^oT) and pi := fj/u;o+ 3 
for i = 1,2, 3, then one may take the limit of the spatial element in the metric (4.52) 
as A tends to zero: 



lim 



COsh3(c^„T) ^(T*)'^'(dxy = ^T2P»(dx^)', 



j=l 



1=1 



where the parameters pi satisfy the relations 
3 



Y.P' = 1' 



E^^ = i- 



i=l 



i=l 



This is precisely the Kasner metric. 

Isotropic moaels 

Consider the closed flrw cosmological model, which is contained in Bianchi type 
IX, the three-metric of which is 



d/2 = A^{t) 



dr 



1 - r^ 



+ r^dn^ 



(4.53) 



^^One may proceed likewise for A < 0: the hyperbolic functions in the solution (4.52) are then 
replaced by the corresponding trigonometric ones (cf. [Buc78]). 
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where dil^ = d-d^ + siii^ {'d)dip; r, -d, (p are the spherical coordinates; and A{t) is 
the scale factor. 

We take A and Ko := ^A^K as configuration variables; their respective con- 
jugate momenta are Ha '■= QAp^^ (for i = j) and Vo '■= ^A~'^V. For an arbitrary 
nonlinear Lagrangian the latter momentum becomes Vo '■= —2\'3Af'{R) according 
to the definition (3.138b). The canonical action is simply 



where the super-Hamiltonian is 



UaA + VoKo - NJ{ 



(4.54) 



J{ = V3Vo-^Ko^ + V{Vo), (4.55) 

with the potential V defined by the expression (3.143). 

For instance, if one considers the Einstein-Hilbert Lagrangian with an addi- 
tional i?-squared term (cf. the Lagrangian density (3.169) with A = = a), then 
the above potential reduces to V{Vo) = m^( 3 ^° + '*~^^)^- The quantum cos- 
mology based on such a variant of the generic quadratic theory was investigated by 
Hawking and Luttrell [HL84]. They showed that the wave function of the closed 
FLRW minisuperspace universe could be interpreted as corresponding in the classi- 
cal limit to a family of — particular — solutions that feature a period of inflation fol- 
lowed by a matter-dominated era. This era exhibits a rapidly oscillating scale factor 
superimposed on an overall expansion, after which the universe recollapses. A di- 
rect comparison with Hawking and Luttrell's canonical formalism shows that their 
canonical Hamiltonian coincides exactly with the super-Hamiltonian (4.55) (with 
the specific aforementioned potential) if the lapse function is taken as N{t) := A{t) 
and once the trivial canonical transformation {QfHLl ~^ 'Po', Hq, ^, -^ —Ko} from 
their variables to ours has been performed. 

We henceforth investigate the pure i?-squared variant of the generic nonUnear 
case. The super-Hamiltonian is given by 

^ = V3Vo-^Ko^ + IaV^ (4.56) 

and the ensuing canonical system, 

A = -^N^, (4.57a) 

ko = N{^AVo + V3), (4.57b) 

^^ = -^{1-^0 +^Ko^), (4.57c) 

Vo = ^N^, (4.57d) 
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possesses the first integral 



AUa + KoVo = ko, 



(4.58) 



wliere ko is an arbitrary constant. Firstly, we consider space-times of nonconstant 
scalar curvature. 



Nonconstant scalar curvature. First of all, we fix the time gauge by setting 

R = pe\ (4.59) 

where p = ±1. According to its definition the variable V^ becomes 

^/3 



Vo 



-pAeK 



(4.60) 



Differentiating this expression and making use of the canonical equations (4.57) and 
the first integral (4.58) we obtain the form of the lapse function that corresponds 
to the gauge choice (4.59), namely 



N{t) = -^^AV. 



(4.61) 



Solving the super-Hamiltonian (4.56) for Ko and performing the canonical trans- 
formation 



A 



el("-*) 



n. 



2(n„ + ^le^i("-*) 



we obtain the reduced form of the departing canonical action, namely 



S 



dt 



UnCt — H 



where the reduced Hamiltonian density is 

1 . Q.3 



H 



4A;o 



4n2-^e3" + 9pV" + fe2 



The resulting canonical system. 



n, 



a = t^IIq,, 

9/5' 3a 9/^2 2^ 

e e , 



is equivalent to the autonomous second-order differential equation^^ 

9p^3a 9p2 



" = ^^ 



U2 



(4.62) 
(4.63) 

(4.64a) 
(4.64b) 

(4.65) 



^Observe that this equation is nothing else than equation (4.40a) particularised to the isotropic 
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which can be formally solved: its general solution is given in terms of elliptic 
integrals. ^^ 

Constant scalar curvature. When the scalar curvature is constant, i.e. R = 4A, we 
obtain from the canonical system (4.57) (with A^ = 1) the elementary second-order 
ordinary differential equation 

A - uj'^A = 0, (4.66) 

where we have defined oj^ := -g, and the first integral 

A^ - u?J^ + 1 = 0. (4.67) 

Once again there are two distinct cases depending on whether the scalar curvature 
be positive or negative: 

^,^ fA-Uosh(^/At) ifA>0, ^ ^ 

A{t) = \ ^ ^ ^ 4.68 

[|A|-2cos(v^t) ifA<0, 

where we have rescaled the coordinates to eliminate irrelevant numerical factors. 
The former corresponds asymptotically to the de Sitter space-time. 
Remark. When the scalar curvature is zero, the general solution to the canonical 
system (4.57) (with A^ = 1) is readily found; it is explicitly A{t) = \/ c\ + C2t — t^, 
where c\ and ci are constants of integration, and contains the special solution that 
corresponds to an Einstein space filled with incoherent radiation (cf. [Sch93]). 

Einstein-de Sitter minisuperspace. The three-metric of the Einstein-de Sitter 
model is 

d/2 = A^(t)(dxVdy2 + dz2). (4.69) 

We proceed as in the case of the closed flrw model. The super-Hamiltonian is 

^=l^'Po-^K^^ (4.70) 



and the ensuing canonical system. 



A = -^N^, (4.71a) 

ko = ^NAVo, (4.71b) 

nA = -Ar(ip„2 + ^i^o^), (4.71c) 

Vo = ^N^, (4.71d) 



^We prefer not to write down the details of the resolution since this is not a very exciting task. 
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possesses two first integrals: 11^ = aA^ and ^111^ + KqVo = b, with a and b 
constant. 

When the scalar curvature is not constant we choose R = pe* and straightly 
integrate the canonical system to obtain the general solution in the form 

ds^ = ^dt' + ^, (4.72) 

(l-noe3*) (l-noe3t)3 

where Uo is a constant of integration. Therefore we see that this representation 
coincides with the isotropic limit (4.47) of the Bianchi-type I solution. 

On the other hand, if we consider spaces of constant curvature, i.e. R = 4A, then 
we obtain from the canonical system (4.71) and the super-Hamiltonian constraint 
(4.70) the elementary differential equation A/A = ztw, where lo"^ := |A, whence 
we derive two distinct solutions (up to rescaled coordinates), 

^^^ A-5e±v^* ifA>0, ^ ^ 

A{t) = { , ^ 4.73 

^Ar2cos(^^t) ifA<0, 



which correspond respectively to a de Sitter space and an anti-de Sitter space. This 
exemplifies the relevance of the /^-squared theory to the inflationary scenario: In 
the field equations the terms that come specifically from the quadratic part of the 
Lagrangian density can play the role of a cosmological constant; moreover it is well 
known that the addition of the R term in the gravitational action introduces a 
new spin-0 scalar field which may act as a natural inflaton in the early universe 
(cf. the Introduction). 

For a zero scalar curvature the canonical equations are easily integrated and we 
obtain the solution ds^ = —dt'^ + tdl'^, which was found independently by Caprasse 
et al. from a search of Groebner bases of the system of algebraic equations associ- 
ated with power-law type solutions of the fourth-order field equations [CDGM91]. 

4.3.3 Conformal Bianchi-type I model 

Canonical equations ana super-Hamiltonian 

In conformal gravity the canonical action (4.25) reduces to 

s = fdt n^/i + n±/3± + r±k± - n:k , (4.74) 

owing to the existence of the primary constraint (3.186), which translates into Vo ~ 
in terms of the new variables introduced on page 147. The explicit form of the 
super-Hamiltonian constraint depends on the Bianchi type considered; for type IX, 
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for instance, this is given by the expression (4.26) without those few terms involving 
Vo- In the action (4.74) we have not written the super-momentum constraints 
since they identically vanish for all diagonal class A models but type VIq; we also 
reiterate that the surface terms at spatial infinity arising in the canonical formalism 
come to naught when one considers space-time isometrics corresponding to spatially 
homogeneous Bianchi cosmologies (cf. the discussion on boundary terms on pages 
131-132). 

We focus on the simplest cosmological model that exhibits nontrivial degrees of 
freedom in conformal gravity, i.e. Bianchi type I — the isotropic FLRW space-times 
are conformally flat. In this case the super-Hamiltonian is 



6 



AV-K+K^ - 2V+iKl - K'i) - K±Ii± 



+ ^K,{K±V± - li,) - ^e-^f^{vl+V^_). (4.75) 

(The coupling constant a of the Weyl-squared term has been set equal to one.) 

We apply the Dirac-Bergmann consistency algorithm to this constrained sys- 
tem. The Dirac Hamiltonian density is Sjxi = N'}{ + XoVo, with a Lagrange multi- 
plier Ao- Conservation of the primary constraint "Pq ~ brings forth the secondary 
constraint Xo ~ 0, viz. 

Po « — ^ xo = n^ - K±P± « 0. 

Since we have the zero Poisson bracket {"PcXo} ~ the above primary and sec- 
ondary constraints are first class (the secondary one is the generator of conformal 
transformations). To proceed we must impose a gauge- fixing condition associated 
with the 'conformal constraint' Xo ~ 0. We introduce the coordinate condition 
/U ~ as an additional (primary) constraint — this amounts to turning the confor- 
mal constraint into second class. Conservation of this additional constraint yields 
the secondary constraint Kq ~ 0. Therefore we end up with a set of four con- 
straints, which are obviously second class. This enables one to eliminate the pairs 
of canonical variables {;U,n^} and {Ka.,Vo]. The canonical action (4.74) reduces 
to 



S = fdt n±/3± + V±k± - % , (4.76) 



where, in accordance with the conformal invariance of the theory and the form of 
the Bianchi metrics (4.24), we have parameterised the lapse function as iV = e^; 
hence the gauge fixing described above amounts to choosing a specific conformal 
factor for the type I metric. The super-Hamiltonian constraint (4.75) now reduces 
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to 



-^ 



AV-K+K- - 2V+(kI -K'i) - K±U^ 



\{Vl+Vl). (4.77) 



Varying the action (4.76) with respect to the canonical variables and conjugate 
momenta we obtain the canonical equations of Bianchi type I in conformal gravity: 



/3± = -^K±, (4.78a) 

ri± = 0, (4.78b) 

k+ = ^{Kl-Kl)-V^, (4.78c) 

k^ = '^^K+K. - V-, (4.78d) 

/fi 
V+ = — (n+ - 4K^V- + 4K+T'+) , (4.78e) 

6 

\/6 
V- = -^ (n_ - AK_V+ - AK+V-). (4.78f) 

6 

This system supplemented with the super-Hamiltonian constraint (4.77) is equiv- 
alent to the Bach equations Bab = 0. (The components of the Bach tensor Bab are 
written down in the Appendix at the end of this section on page 175 ff.) We have 
indeed explicitly checked that: (i) the canonical equations (4.78b) are in fact the 
(spatial) Euler-Lagrange equations, i.e. Ba = for i = 1,2,3, derived from the 
conformal quadratic action, owing to the recursion relations for the momenta II-i- 
that are obtained from equations (4.78e) and (4.78f);^^ (ii) the super-Hamiltonian 
constraint (4.77) coincides exactly with the 'time-time' component of the Bach 
tensor, i.e. Bqq = 0. 

Having at our disposal — instead of the Bach equations — the nice differential 
system (4.78) and the algebraic constraint (4.77) we intend to seek out exact solu- 
tions by performing a global involution algorithm on appropriate extra constraints 
that yield closed constraint algebras. 

Crlobal involution algorithm 

The involution method consists in applying the Dirac-Bergmann consistency algo- 
rithm to our classical system, with the Poisson bracket defined with respect to the 
canonical variables (3±, Il±, K±, 'P±, and after suitable conditions, i.e. additional 



*This equivalence is best understood with a glance at the box on page 76. 
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ad hoc constraints, have been imposed. ^^ Strictly speaking, the steps of the global 
involution algorithm are: 

1. Impose an appropriate extra constraint on the canonical variables. 

2. Require that this constraint be preserved when time evolution is considered. 
This gives rise to secondary constraints and possibly to the determination of 
the Lagrange multiplier associated with the extra constraint. 

3. Repeat Step (2) until no new information comes out. 

Once the involution algorithm has been performed we may classify all the con- 
straints into first class and second class and proceed further to the analysis of the 
particular system. 

Constraint ipk ^ 0. The first extra constraint we consider expresses that the ratio 
of the variables K± be constant, namely 

^(^°) = K_ - kK+ ^0, keR. (4.79) 

The Poisson bracket of ipj. and J{ yields the secondary constraint 

^« = ^k{k^ - 3)Kl + (P„ - kV+) « 0. (4.80) 

The Poisson bracket of (fl. and the Dirac Hamiltonian density 'Hxi '■= 3i + 
Xk^k leads to the determination of the Lagrange multiplier, viz. A^ ~ \/6(n_ — 
A;n_|_)/[6(1 + k'^)], which turns out to be constant, and the algorithm stops. Both 
constraints (4.79) and (4.80) are second class since their Poisson bracket is equal 
to 1 + A;^. Thus we can eliminate the corresponding spurious degrees of freedom. 
To this end we perform the canonical transformation 

V, -^ ^ZJ^. V. -^ ^^±±^^ (4.81) 

l + k^ 1 + P ^ ^ 

The action (4.76) then reduces to 

S = I At n±/3± + V+k+ - %' , (4.82) 

where the super-Hamiltonian (4.77) is now given by 

fc^(fc^-3)^ 4 V6{3k^-l) , 
^ = -WTW ^ 3(1 + A:2) ^+^+ 

-^(n, + m_)i.,-^^. (4.83) 

^^For a rigorous treatment on the concept of involution as applied to constrained systems, we 
refer the interested reader to Seiler and Tucker's article [ST95] (see also [Ger96]). 
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Varying the action (4.82) with respect to the pair of canonical variables K^ and 
V-^. we obtain the canonical equations 



K^ 
V^ 



\/6(3A;2 



11 



3(l + fc2) 



-Ki 



v. 



1 + A:2' 



4A;2(A;2-3)%^3 , 2^6(3^2 



3(1 + A:2)2 



-Ki + 



3(l + fe2) 



-K~V. 






n+ + m_). 



(4.84a) 

(4.84b) 



Taking into account the explicit form of the super-Hamiltonian (4.83) we obtain a 
nonlinear first-order differential equation for K^{t), 



Ki 



2, . 4 \/6(n+ + A:n_) 

^1 + k^)KX - \,T ... ' K^ 



V ' ^ '"^ 3(1 + A;2) 

which can be written as one binomial equation of Briot and Bouquet, 

• 2 4 S 

U = U + 7 U, 



(4.85) 



(4.86) 



in terms of a new variable u defined by the homography u = ib-^i^+VT+A?^, and 
with 7^ = =F|(n+ + A;n_)(l + k"^)^^'"^ . Direct calculation shows that its general 
solution is given by 

«W = tL' (4-87) 



where W stands for the Weierstrass elliptic function p(t — tg; 92,93), with invariants 
92 = and gs = — ^(n_|_ + fcn_) /(I + k"^), and with one arbitrary constant to 
(cf. [AS65, p. 627 ff.]). In terms of the canonical pair (K+jT^^) the representation 
(4.87) corresponds to the expressions 



K+it) 

vAt) 

where 



V6(n+ + m_) 1 

12(1 + A:2) vv' 

\/6(n+ + m_) 



72W2 



(1 + fc2) 



3^2 

'n+ + m_) +6W 





(4.88a) 


dlnW" 
dt 


(4.88b) 



w 



dlnW 
dt 



±-1/144^3 + 



1 + A;2 



To obtain the analytic form of the anisotropic metric functions f3±{t) we must 
integrate the right-hand side of equation (4.88a). We apply the following result 
[AS65, p. 641].26 

^^A prime denotes differentiation with respect to z, the argument of the Weierstrassian functions 
considered hereafter. 
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Proposition 4.3.1. If p'{zo) / 0, then 
dz 



P[zo) 



2zC{zo) + In cr(z - zq) - In a{z + zq), 



p{z) - p(zo) 

with Weierstrassian functions C{z) and a{z) defined by (' {z) + p{z) = and cr'{z) — 
a{z)({z) = respectively. 

To achieve our aim we take z = t and zq = tz, wiiere t^ is a zero of the 
Weierstrass elliptic function, i.e. p(tz) = 0. Indeed, we obtain 



dt 



■ [2t C(tz) + In a{t - tz) - In a{t + t^ 



(4.89) 



w{t) n+ + m_ 

and we can write the corresponding homogeneous metrics of type I under the form 

I 1+V3k 



ds' 



-dt^ + exp 



+ exp 



+ exp 



'2{l-VSk) 



VTT¥ 

4 



iC(iz 




" \/l+fe2 



dx^ 



^ l-\/3fc 



dy2 



VTT¥ 



tC{t. 



crjt - tz 

(T{t + tz 



Vi+P 



dz^. 



(4.90) 



As a particular case of the present analysis we can specialise the above solution 
(4.87) to the axisymmetric case, for which the secondary constraint (4.80) reduces 
to the conditions 



^^k^'^, k€{0,±V3}. 

A_|. y+ 



(4.91) 



According to the canonical transformation (4.81) the new variable V- vanishes 
automatically. The super-Hamiltonian (4.83) becomes 



Ml. = ^B^^KlV. 



3(l + fe2) 

and the corresponding canonical equations are 
. _ VGjSe - 1) 2 ^+ 

+ 3(l + fe2) ^+ 1 + ^2' 

^ V6(3A;^-2) V6, 



T("-+^"-)^-2(iTPy 



n+ + m_). 



(4.92) 



(4.93a) 



(4.93b) 



3(1 + A:2) -'■ ' 6 

As in the general case we find out the binomial equation of Briot and Bouquet 
(4.85), the solution (4.87) of which is valid for any value of the parameter k; hence 
the axisymmetric solution is obtained by setting A; to or ib\/3 in expressions 
(4.88) and finally in the metric (4.90). 
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Constraint (fp ^ 0. Consider the extra constraint expressing that the ratio of the 
variables T'± be constant, namely 

V^ - pV+ « 0, p G M. (4.94) 



V^f 



The Poisson bracket of cpp and !Kd := !H + Xp(pp yields the secondary constraint 



^« 



4P+ [K^{p^ - 1) - 2pK+] + (n_ - pU+) « 0. (4.95) 

To check the consistency of the involution algorithm we consider two distinct cases: 

1. The secondary constraint (4.95) is identically satisfied if n_ — pU-^- « and 
K^{p'^ — 1) — 2pi^+ « 0. Again, in order to proceed, we split up the analysis 
into two subdivisions: 

(a) If p / (T, with a = ±1, the following weak equality holds: K^ « 
2pK+/{p^ — \). The Poisson bracket of (/Jp and 3iD yields the constraint 

■3/)Ap-p(/_3)P+ 
2^/6p(l - 3p2)(j32 _ 3) 



V^f 



:i 



3(p2 



1) 



-Ki ^ 0, 



(4.96) 



where p 7^ -^ since we consider nonzero canonical variables V±. Con- 
sistency then leads to the determination of the Lagrange multiplier A- 



P' 



VIZ. 



A. 



2^/6p(p2 



'\kI 



Pip" - 3) 



1 



V4 



3(p2 _ 1)2 + 3p2 

When p G {O, (7\/3} the Poisson bracket of Lpp and ^^d vanishes iden- 
tically and the involution algorithm does not generate any new con- 
straints. This subcase corresponds to a particular case of the axisym- 
metric solution with k = p and n_ ss ^11+ (cf. equation (4.91)). For the 
nonaxisymmetric case, on the other hand, the next step in the involution 
algorithm yields 



n. 



^^/6(3p2-l)(p2 + i) 



3(p+l)3(p 

and the last step gives 



1)3 



Ki 



4{p^ + 1) 

p2 — 1 



Va 



K, 



V6(l-3p2)(a-3)_^2 



6(p2 - 1)2 

At this stage no more constraints arise. Taking into account that the 
super-Hamiltonian (4.77) is weakly vanishing we find out that 11+ must 
be equal to zero. This subcase will be discussed below, as part of our 
discussion on the generic case with Il-i- ss 0. 
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(b) li p = a, then K^ is weakly vanishing. The Poisson bracket of ipp and 
JCl) yields the constraint 

V.f = A, - an + ^Kl « (4.97) 

and consistency determines the Lagrange multiplier Xp. The Poisson 

(2) 

bracket of ipp and ^d gives the weak equality 

(T O 

The last step in the involution algorithm yields 

At this stage no more constraints arise. The vanishing of the super- 
Hamiltonian (4.77) restricts 11+ to be zero. As in the previous nonaxi- 
symmetric case with p / o", this subcase will be discussed below. 

2. When n_ —pTl+ / 0, the Poisson bracket of ipp and !Kd yields the constraint 

(^(,2) = ^p(^p2 _ 3)p2 _ ^(^ _ :ip2)XpV+ 

- AV+ {K+ + pK^) {pK+ - K^) 

+ n_ {K+ - pK^) + n+(pK+ + K^) « 0. 

(5) 

lip = -y=, the involution algorithm closes when ipp is computed, but it does 
not lead to any exact solutions since the super-Hamiltonian is not compatible 
with the constraints (pp for j = 1, . . . , 5. On the other hand, if p / -^, we 
have not been able to close the algorithm. It turns out, however, that the 
involution algorithm in this case is useless since the system under considera- 
tion does not produce an integration case, as it can be shown by a local study 
of its analytic structure [DQS98].2^ 

Constraints Il± « 0. Consider now the extra constraints expressing that the 
canonical variables II-i- be equal to zero. Contrary to the involution of the previ- 
ous constraints ipk and fp, the consistency algorithm here is trivial: it is unable 
to produce any new information since there are no secondary constraints. The 
constraints II-i- f» are first class; we can choose their corresponding Lagrange 
multipliers X± to be zero. In that case the constraints remain weak equations to 
be imposed on the physical states of the system. Hence, at the classical level, the 

^'^ Strictly speaking, the system with the constraint (4.94) does not possess the Painleve property. 
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appropriate system of canonical equations is the system (4.78), where we set Il-i- 
to zero. Besides equation (4.78a), the relevant equations are thus 

k^ = ^{K^_-Kl)-V^, (4.98a) 

k_ = ^^K+K. - v., (4.98b) 

P+ = ^{K+V+ - K^V^), (4.98c) 

■p_ = _^(K_p^ + K+V^). (4.98d) 



The super-Hamiltonian (4.77) is now given by 



^ V6r 



2V-K+K^ - V+iKl - Kl^ 



-{Vl + Vl). (4.99) 



The general solution of equations (4.98) is easy to produce under closed analytic 
form. We first solve equations (4.98a) and (4.98b) for V^ and V- respectively and 
write down second-order equations for K+ and K_, namely 

m± = 4Si^±, with T.:=Kl + K'i. (4.100) 

Aside from the super-Hamiltonian constraint, the canonical system (4.98) possesses 
the first integral 

K+k^ - K^k+ = S, (4.101) 

where S is an arbitrary constant. This first integral has been found independently 
from a direct analysis of the Bianchi-type I field equations in conformal gravity 
[Rem98]. Making use of the super-Hamiltonian constraint (4.99) we produce a 
scalar second-order equation for S, namely S = 4S^. We integrate this equation 
and obtain the solution 2S = 3p{t — to; 0,53), with arbitrary constants to and 
gs = -g-^^- This reduces the system (4.100) to linear differential equations of the 
Lame type, i.e. 

K± = 2p(t-to;0,<73)i^±, 

which are studied exhaustively in [Inc44, WW50]. If we adopt Ince's notations, 
our particular case of the Lame equation is specified by /i = and n = l?^ If we 
introduce tz such that the transcendental equation p(tz) = is satisfied, i.e. tz is 

^^The general solution to the system (4.98) is uniform; this confirms the single-valuedness of a 
possible integrability case detected by the Painleve test: the complete system with n± — has 
the Painleve property [DQS98]. 
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a zero of the Weierstrass elliptic function, then the general solution to the system 
under study is given by the fundamental set 

i^±,i = exp[-tC(tz)]^^^^, (4.102a) 

K±,2 = exp[+taQ]^^^-^, (4.102b) 

with Weierstrassian functions C(t) and a{t) defined by ((t) + p{t) = and &{t) — 
a{t)({t) = respectively. The solutions given by the fundamental set (4.102) are 
distinct, provided that the parameters e, for i = 1, 2,3, which are defined by 

61 + 62 + 63 = 0, 4(6263 + 6361 + 6162) = -52 = 0, 4616263=5(3, 

are not equal to zero; this is indeed the case here whenever gs / 0. If 53 = = 6, 
then 61 = 62 = 63 = and the solutions in the fundamental set are one and the 



2Q 

same: 



Ve sin AK 
K+ = ±^73^' (4-103a) 

v/G cos AK 



^/6 1±sin4i^-2sin2 4K 
1 (t - to)' 



n = ^ 7^-TT2 ' (4-103C) 



V6cos4i^(l±2sin4K) 

with K constant. Denoting x '■= i~io and integrating equations (4.78a) we obtain 
type I homogeneous metrics under the form 



ds2 = -dx^ + V X^"' {dx'f, (4.104) 



where the parameters fj for i = 1,2,3 satisfy the relations 

3 3 

2 



^Ui = 0, Y^K 



2_3 



j=l i=l 



^^This case is better understood in the light of its analytic structure: it turns out that one of 
the relevant singularity families of our general canonical system becomes an exact two-parameter 
particular solution when n± vanish [DQS98]. 
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Analytic structure oi the Biancni-type I system 

The Painleve strategy. The existence of the above solutions, whether particu- 
lar solutions of the general differential system or general solutions of specialised 
systems, tells nothing about the integrability or non-integrability of the complete 
system and gives no information whatsoever about the mere accessibility of an 
exact and closed-form analytic expression of its general solution. This is due to 
the fact that the global involution algorithm of the extra constraints, as operated 
above, is not related with integrability and may even prove to be nonexhaustive. 
The integrability issue is tackled through an invariant investigation method of in- 
trinsic properties of the general solution. In particular, the result does not depend 
on specific choices of the metric, within some well-defined equivalence class. 

The approach advocated by Painleve proceeds from the main observation, nowa- 
days frequently exemplified in various areas of theoretical physics, that all ana- 
lytic solutions encountered are single-valued or multiple-valued finite expressions 
(possibly intricate) depending on a finite number of functions — solutions of linear 
equations, elliptic functions, and the six transcendental functions systematically 
extracted by Painleve and Gambier. Therefore, the building blocks of this pro- 
cess are the functions, which are explicitly defined through their single- valuedness. 
This emphasises the relevance of the analytic structure of the solution, as uni- 
formisability ensures adaptability to all possible sets of initial conditions. The 
next step of this approach deals with integrability in some fundamental sense. In- 
deed, probing the analytic structure of a system requires a global, as opposed to 
local, integrability-related property, namely the Painleve property.^^ At this level, 
integrable systems are defined in the sense of Painleve as those systems that pos- 
sess the Painleve property.^^ Moreover, in keeping with the above observation, 
the requirement for global single-valuedness ought to be relaxed to accommodate 
this definition to integrability in the practical sense. Upon using broader classes of 
transformations, the (unavoidable) analytic structure of the solution is easily kept 
under control and either allows or rules out the possibility to produce a mode of 
representation of the general integral. 

Such an approach requires the analytic continuation of the solution in the 
complex domain of the independent variable and computes generic behaviours of 
the solution in a vicinity of each movable singularity. The trivial part of the 



^^ A differential system possesses this property if, and only if, its general solution is uniformisable 
or, equivalently, exhibits no movable critical singularity. 

^^The Painleve property is invariant under arbitrary holomorphic transformations of the inde- 
pendent variable and arbitrary homographic transformations of dependent variables. In particu- 
lar, the results given in greater detail in [DQS98] are invariant under arbitrary analytic reparam- 
eterisations of time. 
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Painleve method, known as the Painleve test, produces necessary but not sufficient 
conditions for a system to enjoy the Painleve property and requires local single- 
valuedness of the general solution in a vicinity of all possible families of movable 
singularities.^^ 

Analytic proof of non-integrability. In [DQS98] we have shown that the complete 
differential system (4.78) admits three distinct local leading behaviours in some 
vicinity of movable singularities, that is, three distinct singularity families. The 
first leading behaviour, denoted /(i), is valid in a vicinity of the movable point 
t = ti and exists only when n+n_ / 0. The existence of the second singularity 
family, referred to as /(2),±5 valid in some vicinity of the movable point t = t2, 
requires n_(2\/3n_ it 311+) / 0. The third and last leading behaviour, denoted 
/(3),±5 is valid in some vicinity of the movable point t = t^ and is associated 
with another arbitrary constant parameter, denoted K in [DQS98]; moreover it 
exists only when AK {0,7r, -tt, |, -|, =Ff , ^F^}- ^he super-Hamiltonian (4.77) 
introduces no new restrictions since, at leading order in x '■= t — ti for all i = 1,2, 3, 
it always vanishes in some vicinity of movable singularities around which leading 
behaviours /(i), /(2),±, and /(3)^-t hold. 

Around each species of movable singularities one must inquire whether it is 
possible to generate single-valued generic local representations of the general so- 
lution. The Fuchsian indices have been computed for each leading behaviour and 
all preliminary conditions have proved to be fulfilled. However, near both movable 
points t2 and t^ in C, it is not possible to build single- valued generic local expan- 
sions, unless some constraints are imposed. Upon introducing movable logarithmic 
terms in these local series one may regain genericness — at the expense of losing the 
Painleve property. ^^ 

Such multiple- valuedness, betrayed by local investigations, also pertains to the 
general solution itself. This analytically proves that the system under consideration 
is not integrable: Its general solution exhibits, in complex time, an infinite number 
of logarithmic transcendental essential movable singularities; in other words, an an- 
alytic structure not compatible with integrability in the practical sense — the quest 
for generic, exact and closed-form analytic expressions of the solution is hopeless. 
This result holds under space-time transformations within the equivalence class of 
the Painleve property; see [Con99]. Yet, local information produced by the Painleve 



^^ Movable essential singularities are difficult to handle, since one lacks methods to write down 
conditions under which they are indeed noncritical. For a tutorial presentation of basic ideas and 
constructive algorithms aimed at the generation of integrability conditions, we refer the interested 
reader to [Con99, RGB89]. 

^^For a thorough account and a complete proof of these claims, cf. [DQS98, Appendix B]. 
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test may still be used in order to extract all particular systems that may prove to 
be integrable. In the case under study, it turns out that all particular solutions are 
meromorphic, but this is only a result of the application of the method. 

All integrable particular cases. Even though this does not provide a proof but 
merely an indication of uniformisability, the Painleve test requires local single- 
valuedness around all possible species of movable singularities. Utilising the results 
obtained from the Painleve analysis [DQS98, Appendix B] and comparing them 
with those derived from the global involution algorithm above we draw the following 
conclusions: 

• Near the movable point t = ^2, the complete set of integrability conditions 
is satisfied if one imposes n„ = ib\/3n+ and simultaneously requires the 
vanishing of the arbitrary constant parameter associated with the index j = 
—3. On the other hand, if one sets 2\/3n_ = =F3n_|_ or n_ = 0, the family 
/(2),± dies out and ipso facto no restriction needs fulfilment. 

• Near the movable point t = t^, single- valuedness may only be recovered in two 
particular cases. The first case deals with a local representation of the general 
solution of that specialised system obtained with Il-j- = 0. In this case local 
leading behaviours /(i) and /(2),± die out whereas, near t = t^, a meromorphic 
local generic expansion is produced. This corresponds to the closed-form 
solution (4.102). The second case deals with a local representation of some 
particular solution of the complete differential system and requires that the 
arbitrary parameter K be such that sinAK = n_|_(n^-|-n?_)~^'^ and cosAK = 

• The closed-form solution (4.90) obtained by imposing the extra constraint 
(/jfc ~ on the system corresponds to one particular integrable case, for it 
turns out that: 

1. all indices that characterise the /(3),± singularity family are compatible; 

2. the /(2),± singularity family dies out unless k G {0,ib-v/3}, but in this 
latter instance the indices are compatible. 

• The particular axisymmetric solution with "P^ = ±\/^V-^- corresponds to the 
integrable case of the f(2),± singularity family. 

• The closed- form solution (4.104) corresponds to the case when the singularity 
family /(3),± becomes an exact two-parameter solution whenever II-i- = 0. 
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These results clearly indicate that the global involution algorithm of extra con- 
straints has proven to be exhaustive in the search for exact solutions that may be 
written in closed analytical form. 

Coniormal relationsnip witn hinstein spaces 

The problem of conformal relationship between Riemannian spaces and Einstein 
spaces was first addressed circa 1920 by Brinkmann who studied the necessary and 
sufficient conditions for n-dimensional spaces to be conformally related to Einstein 
spaces [Bri24, Bri25].^^ Kozameh, Newman, and Tod reexamined this question for 
four-dimensional manifolds and obtained nice results by addressing the problem at 
one and the same time from the tensor and spinor points of view [KNT85].^^ 

In the context of conformal gravity it is crucial to determine whether closed- 
form solutions are conformally related to Einstein spaces or not: Since every Ein- 
stein space or every space conformal to an Einstein space fulfils the vacuum Bach 
equations automatically, any closed-form solution that can be mapped onto an 
Einstein space can be thought of — from the point of view of generalised gravity 
theories — as a minor solution; as a matter of fact the interesting solutions will be 
those that are not conformal to an Einstein space. ^^ 

In the case under study, i.e. Bianchi-type I cosmology, despite the fact that the 
general system is not integrable, as was proven by the Painleve analysis, we have 
obtained all the particular exact solutions that may be written in closed analytical 
form. In keeping with the remark above, it is natural to discuss the conformal 
relationship of those solutions with Einstein spaces [Que98]. 

A four- dimensional space-time (M, g) can be mapped onto an Einstein space 
under a conformal transformation gab = e^"^^' gah if and only if there exists a 
smooth function o-(x) that satisfies 

Lab = VaaVba - VaVba - -gabO"'^'^ c<j'^ d(r - -e^" Qab, (4.105) 

2 

where the tensor Lab is defined by Lab '■= -^{Rdab — QRab) and A := ^R denotes 
the cosmological constant that characterises the conformal Einstein space (M, g) 
[Eis66]. The first integrability conditions of equation (4.105) are given by 

VdCi,, + Ci,,Vda = 0. (4.106) 



^■^ Standard results can be found in Schouten's and Eisenhart's textbooks [Sch54, Eis66]. 

^^The very first who undertook this problem within the framework of the spinor formalism was 
Szekeres [Sze63]; however, the necessary and sufficient conditions he found are extremely difficult 
to translate into tensorial expressions. By contrast, Kozameh, Newman, and Tod's analysis yields 
simpler results. 

^^These are also called 'nontrivial' solutions (according to Schmidt's terminology [Sch84]). 
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These are the necessary and sufficient conditions for a space to be conformally re- 
lated to a 'C-space', i.e. V^^L^a = 0,^^ [Sze63]. The second integrability conditions 
of equation (4.105) are 

Bab := 2V^V'^C,,6rf + C^mR"" = 0. (4.107) 

Therefore, fulfilment of the Bach equations is a necessary condition for a space to 
be conformally related to an Einstein space. If considered separately the above 
integrability conditions of equation (4.105) are merely necessary conditions with 
regard to the conformal relationship with Einstein spaces. However, Kozameh, 
Newman, and Tod have proven that they constitute a set of sufficient conditions 
as well [KNT85]: 

Theorem 4.3.2. A space-time (M, 5) is conformally related to an Einstein space 
(M, ^) if and only if equations (4.106) and (4.107) are fuMUed. 

Specifying equation (4.105) for Bianchi type I we obtain the differential system 
2a-&^ + 5(/32 + /?2 ) _ Ae2a ^ q^ 

a^ + 2&{i3+ + \/3/3_) + /3+ + ^3(3. - 
a^ + 2ct(/3+ - \/3/3_) - /3+ - \/3/3_ - 
d2-4a/3+-2/3+-/32 -/?2 _|e2- 

which readily integrates to yield the necessary and sufficient condition for a Bianchi- 
type I space to be conformally related to an Einstein space, namely 

K± = -V6f3± = -V6k±e-'^'', k± / 0, (4.109) 

and provides one with a differential equation that enables one to determine explic- 
itly the conformal factor x := 6^°^, viz. 

x^ = -Ax^ + 4{kl + k'i). (4.110) 

This last equation can be written as the Weierstrass ordinary differential equation 
in terms of the Weierstrass elliptic function W := p{t — to; 52, 53), with invariants 
§2 = and gs = — |A^(A;^ + k'^). Therefore, the conformal factor that brings a 
Bianchi-type I space onto an Einstein space is given by 

g2a(t) ^ 3A-i>v. (4.111) 









(4.108a) 


Pl- 


-/3^ 


-fe^-O, 


(4.108b) 


(il- 


-/5^ 


-fe^-O, 


(4.108c) 


= 0, 






(4.108d) 



^'^There is a natural hierarchy of classes of Riemannian spaces of which the most general class 
consists of those spaces in which the Bianchi identities take the form '^ dC^abc ~ or, equivalently, 
V[cLi,]a = 0; these are called 'C-spaces' and contain the Einstein spaces as a subclass [Sze63]. 
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Taking this last result into account we get from equation (4.109) the explicit form 
of the functions K±{t), that is 



Ki{t) = -^AkiW-\ 



(4.112) 



So far we have not used the Bach equations nor the equivalent canonical system 
(4.78). If we do so, we see that the expression (4.112) does actually coincide with 
the solution (4.88a) upon identifying 



K4- 



A 



A;+n+ + A;„n_ 



(4.113) 



Thus we can rewrite type I homogeneous metrics (4.90) under the equivalent form 
ds = —dt + exp 



±^(^±±^t^K(t. 



exp 



+ exp 



'kl + kl 
2{k+ - Vsk^ 




(4.114) 



This is in agreement with the fact that imposing the constraint (4.79) — here, a 
direct consequence of equation (4.109) — on the Bach equations is equivalent to 
requiring the condition (4.106) to be fulfilled, as it can be proved with the help 
of Reduce. In other words, assuming that the ratio of variables K± be constant 
is a necessary condition for a Bianchi-type I space to be conformally related to 
an Einstein space; in conformal gravity, this becomes a sufficient condition as 
well. Thus the only way to find out a solution to the Bach equations that is 
not conformally related to an Einstein space is to relax the constraint (4.79). In 
accordance with our analysis of the preceding sections, we have indeed obtained 
the only closed-form analytical solution that cannot be mapped onto an Einstein 
space, namely the general solution (4.102) to the canonical system (4.98), thereby 
confirming Schmidt's conjecture on the existence of such solutions [Sch84, Que98]. 
Now examine the particular case of a vanishing cosmological constant. Equation 
(4.110) becomes trivial and yields (up to an irrelevant constant of integration) 



,Mt) 



t-to=: X- 



(4.115) 
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Inserting this result into equation (4.109) and integrating we obtain type I homo- 
geneous metrics under the form 

which coincides with the exact two-parameter particular solution (4.104) in the 
case of zero ll-i-, upon identifying /c_|_ = ^^ sin4i^ and fc_ = — ^ cos4i^. Perform- 
ing a conformal transformation with conformal factor (4.115) and introducing the 
proper time r := x^ we derive the metric (4.116) that characterises the conformal 
Einstein space with zero cosmological constant, namely 



ds^ = -dr^ + ^T2P'(dx^)^ (4.117) 



where the parameters pi for i = 1, 2, 3 satisfy the relations 

m+P2+P3 = l, pl+pl+pl = ^[l + Kkl + k'i)\=l. (4.118) 

Hence, as expected, we recover the Bianchi-type I Kasner solution of vacuum gen- 
eral relativity. 



Appendix: Bach tensor 

We have computed the components of the Bach tensor (2.21) for the Bianchi-type I 
metric with Misner's parameterisation, utilising the EXCALC package in Reduce; 
they are explicitly: 

Soo = 2/3'_/3_ -'I3l- 12/?1 - 24/32/32 + 2i3j+ - '$1 - 12/3^, 
Bii = \/3C/3'_ - 24/3_/32 _ S/?./?^ _ l6/3_/3+/3+) - 2'/3'_/3_ + '(31 

- 16/3_/3_/3+ + 12/31 - 8/?2 /3+ + 24/3^ /j2 + -^'^ 

-2'/3+/3+ + ^2 _24^^^2^12/3t, 
^22 = V^(P- - 2Aj3.fii - 8/3_/32 - 16/3_^+/3+) + 2'/3'„/?_ - '[31 

+ 16/3_/3_/3+ - 12/31 + 8/?^/3+ - 24/3^/32 _ "^'^ 

+ 2'/3+/3+-^2^24/3+/32 -12/3^, 
^33 = 2/3'_/3_ - /32 + 32/3_/3_/3+ - 12/31 - 16/3^/3+ - 24/3!/?2 

+ 2'/3 + + 2'/3'+/3+ -'[3l- 48/3+/32 - 12/3^. 
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"When you have read these hastily 
scrawled pages you may guess, though 
never fully realize, why it is that I must 
have forgetfulness or death. " 

— H. P. Lovecraft, "Dagon" 



